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These notes were written to suit the contents of the course "Algebraic meth- 
ods" given at NTU from August to October 2009, 2010 and 2011. 

The main structure of the notes comes from the book by Robert Ash [1], a 
reference for this course. 

The discussions on permutations were inspired by the notes of Peter Hcn- 
drikus KrophoUer (http : / /www . maths . gla . ac . uk/~phk/3H_GRF_Chapter_2 . 
pdf). 

The proof of Jordan- Holder Theorem was inspired by the one given by Stuart 
Rankin (www . math . uwo . ca/~ srEinkin) . 

The presentation of the Primitive Element Theorem is based on the one given 
by Ken Brown (www.math.cornell.edu/~kbrown/4340/primitive.pdf). 

Many parts in the Chapter on Galois Theory are based on the book by 
Stewart [6]. 

For the history comments, they are taken from [2, 3, 4, 5]. 

Exercises have been collected during these past years from different sources. 
Are included a couple of exercises from Lam's book, and a couple of exercises 
from G. Berhuy. 

Finally, pictures are coming from Wikipedia. 
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Chapter A 

Group Theory 



Most lectures on group theory actually start with the definition of what is a 
group. It may be worth though spending a few lines to mention how mathe- 
maticians came up with such a concept. 

Around 1770, Lagrange initiated the study of permutations in connection 
with the study of the solution of equations. He was interested in understanding 
solutions of polynomials in several variables, and got this idea to study the be- 
haviour of polynomials when their roots are permuted. This led to what we now 
call Lagrange's Theorem, though it was stated as [5] // a function f{xi, . . . , Xn) 
of n variables is acted on by all n\ possible permutations of the variables and 
these permuted functions take on only r values, then r is a divisior ofn\. It is 
Galois (1811-1832) who is considered by many as the founder of group theory. 
He was the first to use the term "group" in a technical sense, though to him it 
meant a collection of permutations closed under multiplication. Galois theory 
will be discussed much later in these notes. Galois was also motivated by the 
solvability of polynomial equations of degree n. From 1815 to 1844, Cauchy 
started to look at permutations as an autonomous subject, and introduced the 
concept of permutations generated by certain elements, as well as several nota- 
tions still used today, such as the cyclic notation for permutations, the product 
of permutations, or the identity permutation. He proved what we call today 
Cauchy's Theorem, namely that if p is prime divisor of the cardinality of the 
group, then there exists a subgroup of cardinality p. In 1870, Jordan gathered all 
the applications of permutations he could find, from algebraic geometry, num- 
ber theory, function theory, and gave a unified presentation (including the work 
of Cauchy and Galois). Jordan made explicit the notions of homomorphism, 
isomorphism (still for permutation groups), he introduced solvable groups, and 
proved that the indices in two composition series are the same (now called 
Jordan-Holder Theorem). He also gave a proof that the alternating group An 
is simple for n > 4. 

In 1870, while working on number theory (more precisely, in generalizing 
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Kummer's work on cyclotomic fields to arbitrary fields) , Kronecker described in 
one of his papers a finite set of arbitrary elements on which he defined an abstract 
operation on them which satisfy certain laws, laws which now correspond to 
axioms for finite abelian groups. He used this definition to work with ideal 
classes. He also proved several results now known as theorems on abelian groups. 
Kronecker did not connect his definition with permutation groups, which was 
done in 1879 by Frobenius and Stickclbcrger. 

Apart permutation groups and number theory, a third occurence of group 
theory which is worth mentioning arose from geometry, and the work of Klein 
(we now use the term Klein group for one of the groups of order 4), and Lie, 
who studied transformation groups, that is transformations of geometric objects. 
The work by Lie is now a topic of study in itself, but Lie theory is beyond the 
scope of these notes. 

The abstract point of view in group theory emerged slowly. It took some- 
thing like one hundred years from Lagrange's work of 1770 for the abstract 
group concept to evolve. This was done by abstracting what was in commun to 
permutation groups, abelian groups, transformation groups... In 1854, Cayley 
gave the modern definition of group for the first time: 

"A set of symbols all of them different, and such that the product of any two of 
them (no matter in what order), or the product of any one of them into itself, 
belongs to the set, is said to be a group. These symbols are not in general con- 
vertible [commutative] , but are associative.^^ 

Let us start from there. 

1.1 Groups and subgroups 

We start by introducing the object that will interest us for the whole chapter. 

Definition 1.1. A group is a non-empty set G on which there is a binary 
operation (a, b) i— > ab such that 

• if a and b belong to G then ab is also in G (closure), 

• a{bc) = (ab)c for all a,b,c in G (associativity), 

• there is an element 1 £ G such that al = la = a for all a e G (identity), 

• if a e G, then there is an element a^^ G G such that aa^^ = a^^a = 1 
(inverse). 

One can easily check that this implies the unicity of the identity and of the 
inverse. 

A group G is called abelian if the binary operation is commutative, i.e., 
ab = ba for all a,b E G. 

Remark. There are two standard notations for the binary group operation: ei- 
ther the additive notation, that is (a, fe) n- a + 6 in which case the identity is 
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denoted by 0, or the multiplicative notation, that is (a, b) H> ab for which the 
identity is denoted by 1. 

Examples 1.1. 1. Z with the addition and as identity is an abelian group. 

2. Z with the multiplication is not a group since there are elements which 
are not invertible in Z. 

3. The set of n x ?i invertible matrices with real coefficients is a group for 
the matrix product and identity the matrix I„. It is denoted by G'L„(IR) 
and called the general linear group. It is not abelian for n > 2. 

The above examples are the easiest groups to think of. The theory of algebra 
however contains many examples of famous groups that one may discover, once 
equipped with more tools (for example, the Lie groups, the Brauer group, the 
Witt group, the Weyl group, the Picard group,... to name a few). 

Definition 1.2. The order of a group G, denoted by |G|, is the cardinality of 
G, that is the number of elements in G. 

We have only seen infinite groups so far. Let us look at some examples of 
finite groups. 

Examples 1.2. 1. The trivial group G = {0} may not be the most exciting 
group to look at, but still it is the only group of order 1. 

2. The group G = {0, 1, 2, . . . , 7i — 1} of integers modulo n is a group of order 
n. It is sometimes denoted by Z„ (this should not be confused with p-adic 
integers though!). 

Definition 1.3. A subgroup H oi a, group G is a non-empty subset of G that 
forms a group under the binary operation of G. 

Examples 1.3. 1. If we consider the group G = Z4 = {0, 1, 2, 3} of integers 
modulo A, H = {0, 2} is a subgroup of G. 

2. The set of n x n matrices with real coefficients and determinant of 1 is 
a subgroup of GL„(]R), denoted by S'L„(M) and called the special linear 
group. 

At this point, in order to claim that the above examples are actually sub- 
groups, one has to actually check the definition. The proposition below gives an 
easier criterion to decide whether a subset of a group G is actually a subgroup. 

Proposition 1.1. Let G be a group. Let H be a non-empty subset of G. The 
following are equivalent: 

1. H is a subgroup of G. 

2. (a) x,y € H implies xy € H for all x, y. 
(b) X E H implies x^^ G H . 
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3. x,y £ H implies xy~^ G H for all x, y. 
Proof. We prove that 1. ^ 3. ^ 2. ^ 1. 

1. => 3. This part is clear from the definition of subgroup. 

3. => 2. Since H is non-empty, let x £ H. By assumption of 3., we have that 
xx~^ = 1 G H and that Ix^^ e H thus x is invertible in H. We now 
know that for x,y G H , x and y^^ are in H, thus x{y~^)~^ = xy is in H. 

2. 1. To prove this direction, we need to check the definition of group. Since 

closure and existence of an inverse are true by assumption of 2., and 
that associativity follows from the associativity in G, we are left with the 
existence of an identity. Now, if x G H, then x~^ G H hy assumption of 
2., and thus xx~^ ^ 1 G H again by assumption of 2., which completes 
the proof. 

□ 

We will often use the last equivalence to check that a subset of a group G is 
a subgroup. 

Now that wc have these structures of groups and subgroups, let us intro- 
duce a map that allows to go from one group to another and that respects the 
respective group operations. 

Definition 1.4. Given two groups G and H , a group homomorphism is a map 
f -.G^ H such that 

f{.xy) = f{x)f{y) for all x,y gG. 

Note that this definition immediately implies that the identity Ifj of G is 
mapped to the identity 1h of H. The same is true for the inverse, that is 
fix-') = f{x)-\ 

Example 1.4. The map exp : (M, (M*, •), x H' exp(a;) is a group homo- 
morphism. 

Definition 1.5. Two groups G and H are isomorphic if there is a group homo- 
morphism f : G ^ H which is also a bijection. 

Roughly speaking, isomorphic groups are "essentially the same" . 

Example 1.5. If we consider again the group G = Z4 = {0, 1, 2, 3} of integers 
modulo 4 with subgroup H = {0,2}, we have that H is isomorphic to Z2, the 
group of integers modulo 2. 

A crucial definition is the definition of the order of a group element. 

Definition 1.6. The order of an element a e G is the least positive integer n 
such that a" = 1. If no such integer exists, the order of a is infinite. We denote 
it by \a\. 
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Note that the critical part of this definition is that the order is the least 
positive integer with the given property. The terminology order is used both for 
groups and group elements, but it is usually clear from the context which one 
is considered. 

1.2 Cyclic groups 

Let us now introduce a first family of groups, the cyclic groups. 

Definition 1.7. A group G is cyclic if it is generated by a single element, which 
wc denote by G = (a). We may denote by C„ a cyclic group of n elements. 

Example 1.6. A finite cyclic group generated by a is necessarily abelian, and 
can be written (multiplicativcly) 

{l,a,a2,...,a"-i} with a" = 1 

or (additively) 

{0, a, 2a, . . . , (n — l)a} with na = 0. 

A finite cyclic group with n elements is isomorphic to the additive group Z„ of 
integers modulo n. 

Example 1.7. An nth root of unity is a complex number z which satisfies the 
equation z" = 1 for some positive integer n. Let C„ = e^"/" be an nth root 
of unity. All the nth roots of unity form a group under multiplication. It is 
a cyclic group, generated by (n, which is called a primitive root of unity. The 
term "primitive" exactly refers to being a generator of the cyclic group, namely, 
an nth root of unity is primitive when there is no positive integer k smaller than 
n such that Cn = 1- 

Here are some properties of cyclic groups and its generators. 

Proposition 1.2. IfG is a cyclic group of order n generated by a, the following 
conditions are equivalent: 

1. \a^\ = n. 

2. k and n are relatively prime. 

3. k has an inverse modulo n, that is there exists an integer s such that 
ks = 1 modulo n. 

Proof. Before starting the proof, recall that since a generates G of order n, we 
have that the order of a is n and in particular a" = 1. The fact that \a''\ = n 
means in words that the order of a'' is also n, that is, a'^ is also a generator of G. 
We first prove that 1. 2., while 2. 3. follows from Bezout identity. 
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1. 2. Suppose by contradiction that k and n are not relatively prime, that is, 

there exists s > 1 such that s\k and s\n. Thus n = ms and k = sr for 
some m, r > 1 and we have 

Now since s > 1, m < n, which contradicts that n is the order of a^. 

2. 1. Suppose that the order of a*^ is not n, then there exists m < n such that 

(a'^)™ = 1 and thus n\km since n is the order of a. If and ?i were to 
be relatively prime, then n would divide m, which is a contradiction since 
m < n. 

2. 3. If A: and n are relatively prime, then by Bezout identity, there exist r, s 

such that 1 = fcr + ns and thus kr = 1 modulo n. 

3. ^2. If fcr = 1 modulo n then 1 = fcr + n.s for some s and the greatest common 

divisor of k and ?7 must divide 1, which shows k and n are relatively prime. 

□ 

Corollary 1.3. The set of invertible elements modulo n forms a group un- 
der multiplication, whose order is the Euler function (p{n), which by definition 
counts the number of positive integers less than n that are relatively prime to n. 

Example 1.8. Consider the group Zg ~ {0,1,2,3,4,5}, the group Zg of in- 
vertible elements in Zg is = {1, 5}. We have that Lp{6) = (p{2)(p{3) = 2. 

1.3 Cosets and Lagrange's Theorem 

Definition 1.8. Let H he a subgroup of a group G. If g £ G, the right coset 
of H generated by g is 

Hg = {kg, heH} 
and similarly the left coset of H generated by g is 

gH^{gh, heH}. 

In additive notation, we get H + g (which usually implies that we deal with 
a commutative group where we do not need to distinguish left and right cosets) . 

Example 1.9. If we consider the group Z4 = {0,1,2,3} and its subgroup 
H = {0, 2} which is isomorphic to Z2, the cosets of i7 in G are 

+ H = H, 1 + H = {1,3}, 2 + H = H, 3 + iJ = {l,3}. 

Clearly + H ^ 2 + H and 1 + H = 3 + H . 

We see in the above example that while an element of g e G runs through 
all possible elements of the group G, some of the left cosets gH (or right cosets 
Hg) may be the same. It is easy to see when this exactly happens. 
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Lemma 1.4. We have that Ha = Hb if and only if ab ^ E H for a,b E G. 

Similarly, aH = bH if and only if a^^b £ H for a,b £ G. 

Proof. If two right cosets are the same, that is Ha = Hb, since H \s a, subgroup, 
we have 1 G H and a = hb for some h £ H, so ab^^ = h G H. 

Conversely, if ab^^ = h E H, then Ha = Hhb = Hb, again since H is a 
subgroup. □ 

While one may be tempted to define a coset with a subset of G which is not 
a subgroup, we sec that the above characterization really relies on the fact that 
H is actually a subgroup. 

Example 1.10. It is thus no surprise that in the above example we have Q+H = 
2 + H and 1 + H = 3 + H, since we have modulo 4 that — 2 = 2 G i? and 
1-3 = 2 e H. 

Saying that two elements a,b £ G generate the same coset is actually an 
equivalence relation in the following sense. We say that a is equivalent to b 
if and only if ab~^ £ H, and this relation satisfies the three properties of an 
equivalence relation: 

• reflexivity: aa^^ = 1 £ H. 

• symmetry: if ab^^ £ H then (ab^^)^^ = ba^^ £ H. 

• transitivity: if ab^^ £ H and bc^^ £ H then {ab^^)(bc^^) ~ ac^^ £ H. 

The equivalence class of a is the set of elements in G which are equivalent 
to o, namely 

{b, ab-^ £ H}. 

Since ab^^ £ H '^==^ (ab^^)^^ = ba^^ £ H -^=> b £ Ha, we further have that 

{b, ab~^ eH} = Ha, 
and a coset is actually an equivalence class. 

Example 1.11. Let us get back to our example with the group Z4 = {0, 1, 2, 3} 
and its subgroup H = {0, 2}. We compute the first coset + H = H, and thus 
we now know that the equivalence class of is H, and thus there is no need to 
compute the coset generated by 2, since it will give the same coset. We then 
compute the coset 1 + H = {1,3} and again there is no need to compute the 
one of 3 since it is already in the coset of 1. We thus get 2 cosets, and clearly 
they partition Z4: 

Z4 = {0, 2} U {1, 3} = U (1 + H). 

It is important to notice that the right (resp. left) cosets partition the group 
G (that the union of all cosets is G is clear since we run through all elements 
of G and H contains 1, and it is easy to see that if a; G Ha and x £ Hb then 
Ha = Hb). 
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Example 1.12. Consider R as an additive group with subgroup Z. Every real 
number up to addition by an integer looks like a number in [0, 1). Thus 

K = Uo<^<i(a; + Z), 

and the cosets of Z partition M. 

Furthermore, since the map h i— ^ ha, /i € 77, is a one-to-one correspondence, 
each coset has \H\ elements. 

Definition 1.9. The index of a subgroup H in G is the number of right (left) 
cosets. It is a positive number or oo and is denoted by [G : H], 

If we think of a group G as being partitioned by cosets of a subgroup H, 
then the index of H tells how many times we have to translate H to cover the 
whole group. 

Example 1.13. In Example 1.12, the index [R : Z] is infinite, since there are 
infinitely many cosets of Z in M. 

Theorem 1.5. (Lagrange's Theorem). If H is a subgroup of G, then \G\ = 
|-ff|[G : H]. In particular, if G is finite then \H\ divides \G\ and [G : H\ ^ 
\G\I\H\. 

Proof. Let us start by recalling that the left cosets of H forms a partition of G, 
that is 

G = UgH, 

where g runs through a set of representatives (one for each coset). Let us look 
at the cardinality of G: 

\G\^\UgH\^Y.\9H\ 

since we have a disjoint union of cosets, and the sum is again over the set of 
representatives. Now 

since we have already noted that each coset contains \H\ elements. Wc then 
conclude that 

|G|=El^l = [^^^]l^l- 

□ 

Example 1.14. Consider G = Z, iJ = 3Z, then [G : iJ] = 3. 

Of course, Lagrange did not prove Lagrange's theorem! The modern way 
of defining groups did not exist yet at his time. Lagrange was interested in 
polynomial equations, and in understanding the existence and nature of the 
roots (does every equation has a root? how many roots?...). What he actually 
proved was that if a polynomial in n variables has its variables permuted in 
all n\ ways, the number of different polynomials that are obtained is always a 
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Figure 1.1: Joseph-Louis Lagrange (1736-1813) 



factor of n\. Since all the permutations of n elements are actually a group, the 
number of such polynomials is actually the index in the group of permutations 
of n elements of the subgroup H of permutations which preserve the polynomial. 
So the size of H divides n!, which is exactly the number of all permutations of 
n elements. This is indeed a particular case of what we call now Lagrange's 
Theorem. 

Corollary 1.6. 1. Let G be a finite group. If a (z G, then \a\ divides \G\. In 
particular, a'*^' = 1. 

2. If G has prime order, then G is cyclic. 

Proof. 1. If a G G has order say m, then the subgroup H = {1, a, . . . , a'"~^} 
is a cyclic subgroup of G with order \II\ = m. Thus m divides \G\ by the 
theorem. 

2. Since \G\ is prime, we may take a 7^ 1 in G, and since the order of a has 
to divide |G|, we have \a\ = \G\. Thus the cyclic group generated by a 
coincides with G. 

□ 

Example 1.15. Using Lagrange's Theorem and its corollaries, we can already 
determine the groups of order from 1 to 5, up to isomorphism (see Table 1.1). 
If |G| is prime, we now know that G is cyclic. 

Let us look at the case where G is of order 4. Let g G G. We know that 
the order of g is either 1,2 or 4. If the order of g is 1, this is the identity. If G 
contains an element g of order 4, then that means that g generates the whole 
group, thus G is cyclic. If now G does not contain an element of order 4, then 
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\G\ G 

1 {1} 

2 C2 

3 C3 

4 C4, C2 X C2 

5 C5 



Tabic 1.1: Groups of order from 1 to 5. C„ denotes the cyclic group of order n. 

apart the identity, all the elements have order 2. From there, it is easy to obtain 
a multiplication table for G, and see that it coincides with the one of the group 

Z2 X Z2 = {{x,y) \x,ye Z2} 

with binary operation {x,y) + {x' ,y') = {x + x' ,y + y'). This group is called the 
Klein group, and it has several interpretations, for example, it is the group of 
isometrics fixing a rectangle. 

Remark. The above example also shows that the converse of Lagrange's Theo- 
rem is not true. If we take the group G = C2 x C2, then 4 divides the order of 
G, however there is no element of order 4 in G. 

Once Lagrange's Theorem and its corollaries are proven, we can easily deduce 
Euler's and Fermat's Theorem. 

Theorem 1.7. (Euler's Theorem). If a and n are relatively prime positive 
integers, with n > 2, then 

a-^^") = 1 mod n. 

Proof. Since a and n are relatively prime, we know from Proposition 1.2 that 
a has an inverse modulo n, and by its corollary that the group of invertible 
elements has order (p{n). Thus 

a'^^"^ = 1 mod n 

by Lagrange's Theorem first corollary. □ 

Corollary 1.8. (Fermat's Little Theorem). If p is a prime and a is a 
positive integer not divisible by p, then 

a^~^ = 1 mod p. 

This is particular case of Euler's Theorem when n is a prime, since then 
ip{n) =_p- 1. 
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1.4 Normal subgroups and quotient group 

Given a group G and a subgroup H, we have seen how to define the cosets of H, 
and thanks to Lagrange's Tlieoreni, we already Icnow that tlie number of cosets 
[G : H] is related to the order of H and G by |G| = \H\[G : H]. A priori, the 
set of cosets of H has no structure. We are now interested in a criterion on H 
to give the set of its cosets a structure of group. 

In what follows, we may write H < G for iJ is a subgroup of G. 

Definition 1.10. Let G be a group and H < G. We say that H is a. normal 
subgroup of G, or that H is normal in G, if we have 

cHc^^ = H, for all c G G. 

We denote it H <G, or H <l G when we want to emphasize that H is a proper 
subgroup of G. 

The condition for a subgroup to be normal can be stated in many slightly 
different ways. 

Lemma 1.9. Let H <G. The following are equivalent: 

1. cHcr^ C H for all ceG. 

2. cHcr^ = H for all ceG, that is cH = He for allceG. 

3. Every left coset of H in G is also a right coset (and vice-versa, every right 
coset of H in G is also a left coset). 

Proof. Clearly 2. implies 1., now cHc^^ C H for all c e G if and only if 
cH C He. Let x G i?c, that is x = he for some h H, so that 

X = {cc^^)hc = c{c^^he) = eh' 

for some h' £ H since eHe~^ C H for all c and thus in particular for c~^. This 
shows that He is included in eH or cquivalently that H C eHe^^. 

Also 2. clearly implies 3. Now suppose that eH = Hd. This means that 
c belongs to Hd by assumption and to He by definition, which means that 
Hd = He. □ 

Example 1.16. Let GL„(M) be the group of n x n real invertible matrices, and 
let SLn{M.) be the subgroup formed by matrices whose determinant is 1. Let us 
see that S'i„(M) < GL„(R). 

For that, we have to check that ABA^^ e SLn{R) for all B € 5i„(K) and 
A £ GL„(R). This is clearly true since 

det{ABA-^) = dct(S) = 1. 

Proposition 1.10. If H is normal in G, then the cosets of H form a group. 
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Proof. Let us first define a binary operation on the cosets: {aH, bH) i— >■ {aH){bH) = 
{{ah){bh'), ah € aH, bh' £ bH}. We need to check tliat the definition of group 
is satisfied. 

• closure. This is the part whiclr asks a little bit of work. Since cH = He 
for all c G G, then 

{aH){bH) = a{Hb)H = a{bH)H = abHH = abH. 

Note that this product docs not depend on the choice of representatives. 

• Associativity conies from G being associative. 

• The identity is given by the coset IH = H . 

• The inverse of the coset aH is a^^H . 

□ 

Definition 1.11. The group of cosets of a normal subgroup iV of G is called 
the quotient group of G by N . It is denoted by G/N. 

Let us finish this section by discussing the connection between normal sub- 
groups and homomorphisms. The first normal subgroup of interest will be the 
kernel of a group homomorphism. 

Recall that if / : G — !■ i7 is a group homomorphism, the kernel of / is defined 

by 

Ker(/) = {a e G, f{a) = 1}. 
It is easy to see that Ker(/) is a normal subgroup of G, since 

fiaba-') = f{a)f{b)f{a)-' = = 1 

for all b £ Ker(/) and all a e G. 
The converse is more interesting. 

Proposition 1.11. Let G be a group. Every normal subgroup of G is the kernel 
of a homomorphism. 

Proof. Suppose that N < G and consider the map 

TT : G ^ G/N, a ^ aN. 

To prove the result, we have to show that tt is a group homomorphism whose 
kernel is N. First note that tt is indeed a map from group to group since G/N 
is a group by assuming that N is normal. Then we have that 

n{ab) = abN = {aN){bN) = T:{a)n{b) 

where the second equality comes from the group structure of G/N. Finally 

Kcr(7r) = {a e G I n{a) = N} = {a £ G \ aN = N} = N. 

□ 
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Definition 1.12. Let N <G. The group homomorphism 

TT : G ^ G/N, aN 
is called the natural or canonical map or projection. 

Recall for further usage that for / a group homomorphism, we have the 
following characterization of injectivity: a homomorphism / is injective if and 
only if its kernel is trivial (that is, contains only the identity element). Indeed, 
if / is injective, then Ker(/) = {a, f{a) = 1} = {1} since /(I) = 1. Conversely, 
if Ker(/) ~ {1} and we assume that /(a) = f{b), then 

fiab-') = .f{a)f{b)-' = .f{a)f{a)-' = 1 

and ab^^ = 1 implying that a = b and thus / is injective. 

Terminology. 

monomorphism=injective homomorphism 

epimorphism=surjective homomorphism 

isomorphism=bijective homomorphism 

endomorphism=homomorphism of a group to itself 

automorphism=isomorphism of a group with itself 

We have looked so far at the particular subgroup of G which is its kernel. 
The proposition below describes more generally subgroups of G and H. 

Proposition 1.12. Let f : G ^ H be a homomorphism. 

L If K is a subgroup of G, then f{K) is a subgroup of H . If f is an epi- 
morphism and K is normal, then f{K) is normal. 

2. If K is a subgroup of H , then f^^[K) = {x G G, f{x) G A'} is a subgroup 
of G. If K is normal, so is f^^{K). 

Proof. 1. To prove that f{K) is a subgroup of H , it is enough to show that 
f{a)f{b)^^ e f{K) by Proposition 1.1, which is clear from 

f{a)f{b)-^ = f{ab-') e f{K). 

If K is normal, we have to show that cf{K)c'^ = f{K) for all c E H. 
Since / is an epimorphism, there exists d G G such that f{d) — c, so that 

cf{K)c-' = f{d)f{K)f{d)-' = fidKd-') = f{K) 

using that K is normal. 
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2. As before, to prove that f~^{K) is a subgroup of G, it is enough to 
showing that ab~^ £ f~^{K) for a,b £ f^^{K), which is equivalent to 
show that f{ab^^) E K. This is now true since f{ab^^) = 
with a,b E f^^{K) and K a subgroup. 

For the second claim, we have to show that 
or equivalently 

f{c,r\K)c-') = K, c€G. 
For c S G and a € f~^{K), then 

/(cac-i) = /(c)/(a)/(c)-i e K 

since K is normal. 

□ 



1.5 The isomorphism theorems 

This section presents different isomorphism theorems which are important tools 
for proving further results. The first isomorphism theorem, that will be the 
second theorem to be proven after the factor theorem, is easier to motivate, 
since it will help us in computing quotient groups. 

But let us first start with the so-called factor theorem. Assume that we 
have a group G which contains a normal subgroup N , another group H, and 
/ : G — > iJ a group homomorphism. Let tt be the canonical projection (see 
Definition 1.12) from G to the quotient group G/N: 

G -^-^ H 




t 

G/N 

We would like to find a homomorphism f : G/N ^ H that makes the diagram 
commute, namely 

/(a) - /Xa)) 

for all a E G. 

Theorem 1.13. (Factor Theorem). Any homomorphism / whose kernel K 
contains N can be factored through G/N . In other words, there is a unique 
homomorphism f : G/N H such that / o tt = /. Furthermore 

1. f is an epimorphism if and only if f is. 

2. f is a monomorphism if and only if K = N . 
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3. f is an isomorphism if and only if f is an epimorphism and K = N . 

Proof. Unicity. Let us start by proving that if there exists / such that /ott = /, 
then it is unique. Let / be another homomorphism such that / o tt = /. We 
thus have that 

{foir){a) = {foir){a)^f{a) 

for aU a G G, that is 

f{aN) = f{aN) = f{a). 

This tells us that for all 67V € G/N for which there exists an element 6 in G 
such that 7r(6) = &7V, then its image by either / or / is determined by f{h). 
This shows that / = / by surjectivity of tt. 

Existence. Let aN e G/N such that 7r(a) = aN for a e G. We define 

fiaN) = fia). 

This is the most natural way to do it, however, we need to make sure that this 
is indeed well-defined, in the sense that it should not depend on the choice of 
the representative taken in the coset. Let us thus take another representative, 
say b £ aN. Since a and b arc in the same coset, they satisfy a'^b G N C K, 
where K = Ker(/) by assumption. Since a^^b G if, we have f{a~^b) = 1 and 
thus /(a) = f{b). 

Now that / is well defined, let us check this is indeed a group homomorphism. 
First note that G/N is indeed a group since N <G. Then, we have 

fiaNbN) = fiabN) = f{ab) = f{a)f{b) = f{aN)f{bN) 

and / is a homomorphism. 

1. The fact that / is an epimorphism if and only if / is comes from the fact 
that both maps have the same image. 

2. First note that the statement / is a monomorphism if and only if K = N 
makes sense since K = Ker(/) is indeed a normal subgroup, as proved 
earlier. 

To show that / is a monomorphism is equivalent to show that Ker(/) is 
trivial. By definition, wc have 

Ker(/) = {aN e G/N, f{aN) = 1} 

= {aN e G/N, /Xa)) = /(a) = 1} 
= {aN e G/N, aeK = Kcr(/)}. 

So the kernel of / is exactly those cosets of the form aN with a E K, but 
for the kernel to be trivial, we need it to be equal to N , that is we need 
K = N. 

3. This is just a combination of the first two parts. 
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□ 

We are now ready to state the first isomorphism theorem. 

Theorem 1.14. (1st Isomorphism Theorem). If f : G ^ H is a homo- 
morphism with kernel K, then the image of f is isomorphic to G/K: 

Im(/) ~ G/Ker(/). 

Proof. We know from the Faetor Theorem that 

/ : G/Ker(/) ^ H 

is an isomorphism if and only if / is an epimorphism, and clearly / is an epi- 
morphism on its image, which concludes the proof. □ 

Example 1.17. We have seen in Example 1.16 that 5'L„(M) O GL„(M). Con- 
sider the map 

det : GL„(M) ^ (M*,-)> 

which is a group homomorphism. We have that Ker(det) = S'L„(M). The 1st 
Isomorphism Theorem tells that 

Im(det) ~ GL„(M)/S'L„(R). 

It is clear that det is surjective, since for all a g M*, one can take the diagonal 
matrix with all entries at 1, but one which is a. Thus we conclude that 

M* ~ GL„(M)/5L„(M). 

The 1st Isomorphism Theorem can be nicely illustrated in terms of exact 
sequences. 

Definition 1.13. Let F, G, H, I, . . . be groups, and let f,g,h,... be group ho- 
momorphisms. Consider the following sequence: 

F ^^G H I 

We say that this sequence is exact in one point (say G) if Im(/) = Ker((;). A 
sequence is exact if it is exact in all points. 

A short exact sequence of groups is of the form 

F ^-^G H ^-^l 

where i is the inclusion and j is the constant map 1. 
Proposition 1.15. Let 

1 F — ^ G H — ^ 1 
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be a short exact sequence of groups. Then Im(/) is normal in G and we have a 
group isomorphism 

G/Im(/) ^ H, 

or equivalently 

G/Kev{g) ~ H. 

Proof. Since the sequence is exact, we have that Im(/) = Ker(g) thus Im(/) is 
a normal subgroup of G. By the first Isomorphism Theorem, we have that 

G/Ker(g) ~ Im(.g) = H. 

since Im(g) = Kcr(j) = H. □ 

The formulation in terms of exact sequences is useful to know, since it hap- 
pens very often in the literature that an exact sequence is given exactly to be 
able to compute such quotient groups. 

Let us state the second and third isomorphism theorem. 

Theorem 1.16. (2nd Isomorphism Theorem). If H and N are subgroups 
of G, with N normal in G, then 

H/{Hr\N) ~ HN/N. 

There are many things to discuss about the statement of this theorem. 

• First we need to check that HN is indeed a subgroup of G. To show that, 
notice that HN ~ NH since iV is a normal subgroup of G. This implies 
that for hn G HN, its inverse {hn)~^ ~ n^^h"^ G G actually lives in HN, 
and so does the product {hn){h'n') — h{nh')n' . 

• Note that by writing HN/N, wc insist on the fact that there is no reason 
for A*" to be a subgroup of H. On the other hand, is a normal subgroup 
of HN, since for all hn e HN, we have 

hnNn^^h-^ = hNh-^ C N 

since N is normal in G. 

• We now know that the right hand side of the isomorphism is a quotient 
group. In order to see that so is the left hand side, we need to show that 
iJ n is a normal subgroup of H. This comes by noticing that H O N is 
the kernel of the canonical map tt : G — > G/N restricted to H. 

Now that all these remarks have been done, it is not difficult to see that the 
2nd Isomorphism Theorem follows from the 1st Isomorphism Theorem, as does 
the 3rd Isomorphism Theorem. 

Theorem 1.17. (3rd Isomorphism Theorem). If N and H are normal 
subgroups of G, with N contained in H , then 



G/Hc{G/N)/{H/N). 
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1.6 Direct and semi-direct products 

So far, we have seen how given a group G, wc can get smaller groups, such as 
subgroups of G or quotient groups. Wc will now do the other way round, that 
is, starting with a collection of groups, we want to build larger new groups. 

Let us start with two groups H and K, and let G ~ H x K he the cartesian 
product of H and A', that is 

G = {(/i,fc), heH, keK}. 

We define a binary operation on this set by doing componentwise multiplication 
(or addition if the binary operations of H and K are denoted additively) on G: 

{hi,ki){h2,k2) = {hih2,kik2) e H x K. 

Clearly G is closed under multiplication, its operation is associative (since both 
operations on H and K are), it has an identity element given by 1g = {1h, ^k) 
and the inverse of (/i, k) is {h~^, k~^). In summary, G is a group. 

Definition 1.14. Let H, K be two groups. The group G = H x K with binary 
operation defined componentwise as described above is called the external direct 
product of H and K. 

Examples 1.18. 1. Let Z2 be the group of integers modulo 2. We can 
build a direct product of Z2 with itself, namely Z2 x Z2 with additive law 
componentwise. This is actually the Klein group, also written G2 x G2. 
This group is not isomorphic to Z4! 

2. Let Z2 be the group of integers modulo 2, and Z3 be the group of integers 
modulo 3. We can build a direct product of Z2 and Z3, namely Z2 x Z3 
with additive law componentwise. This group is actually isomorphic to 
Zg! 

3. The group (M, +) x (K, +) with componentwise addition is a direct product. 

Note that G contains isomorphic copies H and K of respectively H and K, 
given by 

H = {(/i, Ik), h eH}, K = {(Iff, fc), k e K}, 

which furthermore are normal subgroups of G. Let us for example see that H 
is normal in G. By definition, we need to check that 

{h,k)H{h-\k-^) C {h,k)eG. 

Let {h! , Ik) G 5, we compute that 

[h, k){h', ik){h-\k-') = {hh'h-\ ife) e 

since hh'h^^ E H . The same computation holds for K . 

If we gather what we know about G, H and K, we get that 
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• by definition, G = HK and H D K = {la}, 

• by what we have just proved, H and K are two normal subgroups of G. 

This motivates the following definition. 

Definition 1.15. If a group G contains normal subgroups H and K such that 
G = HK and H O K ^ {1g}i we say that G is the internal direct product of H 
and K. 

Examples 1.19. 1. Consider the Klein group Z2 x Z2, it contains the two 
subgroups H = {(^,0), h S Z2} and K = {(0,fc), k e Z2}. We have 
that both H and K are normal, because the Klein group is commutative. 
We also have that H O K = {(0,0)}, so the Klein group is indeed an 
internal direct product. On the other hand, Z4 only contains as subgroup 
= {0, 2}, so it is not an internal direct product! 

2. Consider the group Z2XZ3, it contains the two subgroups H = {{h, 0), h £ 
Z2} and K = {(0, k), k e Z3}. We have that both H and K are normal, 
because the group is commutative. We also have that H O K = {(0,0)}, 
so this group is indeed an internal direct product. Also Zg contains the 
two subgroups H = {0,3} ~ Z2 and K = {0,2,4} ~ Z3. We have that 
both H and K are normal, because the group is commutative. We also 
have that HOK = {0}, so this group is indeed an internal direct product, 
namely the internal product of Z2 and Z3. This is in fact showing that 
Ze ~ Z2 X Z3. 

The next result makes explicit the connection between internal and external 
products. 

Proposition 1.18. If G is the internal direct product of H and K, then G is 
isomorphic to the external direct product H x K . 

Proof. To show that G is isomorphic to H x K, wc define the following map 

f : H X K ^ G, f{h, k) = hk. 

First remark that ii h £ H and k £ K, then hk = kh. Indeed, we have using 
that both K and H are normal that 

{hkh-^)k'^ e K, h{kh~^k-^) e H 

implying that 

hkh-^k-^ e KnH = {!}. 
We are now ready to prove that / is a group isomorphism. 
1. This is a group homomorphism since 

/((/i, k)ih', k')) = /(M', kk') = h{h'k)k' = h{kh')k' ^ f{h, k)f{h\ k'). 
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2. The map / is injective. This can be seen by checking that its kernel is 
trivial. Indeed, ii f{h,k) = 1 then 

hk^ 1 ^ h = k-^ ^ he K ^ he HCiK = {!}. 

We have then that /i = fc = 1 which proves that the kernel is {(1,1)}. 

3. The map / is surjectivc since by definition G = HK. 

□ 

Note that the definitions of external and internal product are surely not re- 
stricted to two groups. One can in general define them for n groups Hi, . . . , H„. 
Namely 

Definition 1.16. li Hi, . . . , Hn are arbitrary groups, the external direct prod- 
uct of Hi , . . . , Hn is the cartesian product 

G = Hi X H2 X ■■■ X Hn 

with componentwise multiplication. 

If G contains normal subgroups Hi, ... , Hn such that G = Hi ■ ■ ■ Hn and 
each g can be represented as hi ■ ■ ■ hn uniquely, we say that G is the internal 
direct product of Hi , . . . , Hn . 

We can see a slight difference in the definition of internal product, since 
in the case of two subgroups, the condition given was not that each g can 
be represented uniquely as hih2, but instead that the intersection of the two 
subgroups is {!}. The next proposition shows the connection between these two 
points of view. 

Proposition 1.19. Suppose that G = Hi---Hn where each Hi is a normal 
subgroup of G. The following conditions are equivalent. 

1. G is the internal direct product of the Hi. 

2. H1H2 ■ ■ ■ H,-i DH,^ {!}, for all i = 1, . . . ,n. 
Proof. Let us prove 1. 2. 

1. 2. Let us assume that G is the internal direct product of the Hi, which means 

that every element in G can be written uniquely as a product of elements 
in Hi. Now let us take g G H1H2 ■ ■ ■ O Hi = {1}. We have that g € 
H1H2 ■ ■ ■ Hi^i, which is uniquely written as g ~ /11/12 ■ ■ • /li-ils; • ■ ■ 
hj e Hj. On the other hand, g € Hi thus g = Ihi ■ ■ ■ lHi_i<7 and by 
unicity of the representation, we have hj = 1 for all j and g = 1. 

2. 1. Conversely, let us assume that g £ G can be written either 



g = /11/12 • • • hn, hj e Hj, 
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or 

g = kik2 ■ ■ ■ k„, k.j € Hj. 
Recall that since all Hj are normal subgroups, then 

hihj = hjh,, G Hi, hj G Hj. 

(If you cannot recall the argument, check out the proof of Proposition 
1.18). This means that we can do the following manipulations: 

hih2 ■ ■ ■ hn ~ kik2 ■ ■ ■ kn 
h2- ■ ■ hn ^ (h^^ki) k2- ■ ■ kn 

hs- ■ ■ hn = {h];^ki){h^^k2)k3- ■ ■ kn 

and so on and so forth till we reach 

h„kn^ = (h^^ki) ■ ■ ■ (ft.,7iifc„_i). 

Since the left hand side belongs to iJ„ while the right hand side belongs 
to i^i • • • iJ„_i, we get that 

/i„fc„ ^ e Hn tl Hi - ■ ■ Hn-l = {1}, 

implying that hn = kn- We conclude the proof by iterating this process. 

□ 

Let us get back to the case of two groups. We have seen above that we can 
endow the cartesian product of two groups H and K with a group structure by 
considering componentwise binary operation 

(/ii, fci)(/i2, ^2) = {hih2, kik2) e H X K. 

The choice of this binary operation of course determines the structure of G = 
H X K, and in particular we have seen that the isomorphic copies of H and 
K in G are normal subgroups. Conversely in order to define an internal direct 
product, we need to assume that we have two normal subgroups. 

We now consider a more general setting, where the subgroup K does not 
have to be normal (and will not be in general), for which we need to define a 
new binary operation on the cartesian product H x K. This will lead us to the 
definition of internal and external semi-direct product. 

Recall that an automorphism of a group TJ is a bijective group homomor- 
phism from H to H. It is easy to see that the set of automorphisms of H forms a 
group with respect to the composition of maps and identity element the identity 
map Idn- We denote it by Aut(i7). 
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Proposition 1.20. Let H and K he groups, and let 

p : K ^ Aut(iJ), pk 

he a group homomorphism. Then the binary operation 

{H X K)x {H X K) {H X K), {{h, fc), {h' , k')) k> {hpk{h'), kk') 

endows H x K with a group structure, with identity element (1,1). 

Proof. First notice that the closure property is satisfied. 

(Identity). Let us show that (1, 1) is the identity element. We have 

(^fc)(l,l) = {hpk{l),k) - {h,k) 

for b\\ h ^ H, k ^ K, since pk is a group homomorphism. We also have 

{l,l){h',k') = {p^{h'),k')^{h',k') 

for all h' IE H , k' Cz K, since p being a group homomorphism, it maps Ij^ to 
lAut(_ft:) = Idff . 

(Inverse). Let {h,k) <E H x K and let us show that {p'f.^{h^'^),k^'^) is the 
inverse of [h, k). We have 

ih,k)ip-\h''),k-') = ihp,ip^\h-')),l) = {hh~\l) - (1,1). 

We also have 

{p-^\h-'),k~^){h,k) = {p-^\h~')pk-.{h),l) 
= (Pfe-i(/i-i)p,-i(/i),l) 

using that p^^ ~ pf^-i since p is a group homomorphism. Now 

iPk-^ih-')pk-i{h),l) = {pk-iih-'h),l) = (Pfc-i(l),l) = (1,1) 

using that pf^-i is a group homomorphism for all k K. 

Associativity. This is the last thing to check. On the one hand, we have 

[ih,k)ih',k')]ih",k") = {hpk{h'),kk')ih",k") 

= {hpk{h')pkk'{h"),{kk')k"), 

while on the other hand 

{h,k)[{h' ,k'){h" ,k")] = {h,k){h'pk'{h"),k'k") 

= {hpk{h'pk.{h")),k{k'k")). 

Since ii' is a group, we have {kk')k" ~ k{k'k"). We now look at the first 
component. Note that pkk' ~ Pk ° Pk' using that p is a group homomorphism, 
so that 

hpk{h')pkk'{h") = hpk{h')pk{pk'{h")). 
Furthermore, pk is a group homomorphism, yielding 

hpk{h')pk{pk'{h")) = hpk{h'pk'{h")) 

which concludes the proof. □ 
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We are now ready to define the first semi-direct product. 
Definition 1.17. Let H and K be two groups, and let 

p: K ^ Aut(i?) 

be a group homomorphism. The set H x K endowed with the binary operation 

{{h,k),{h',k'))^{hpk{h'),kk') 

is a group G cahed an external semi-direct product of H and K by p, denoted 
hy G ^ H XpK. 

Example 1.20. Let us consider the group Z2 of integers modulo 2. Suppose 
we want to compute the semi-direct product of Z2 with itself, then we need to 
first determine Aut(Z2). Since an automorphism of Z2 must send to 0, it has 
no other choice than send 1 to 1, and thus Aut(Z2) is only the identity map, 
in which case the semi-direct product is just the direct product, since pk is the 
identity for every k. To have a bigger automorphism group, let us consider 
H = Z3. In that case, apart the identity map, we also have the map x i— >■ 
that is M> 0, 1 H> 2, 2 H> 1. Thus p(0) = po is the identity, p{l) = pi is the 
inverse map, and we can form the external semi-direct product G ~ Xp Z2. 

In fact, this example holds for Z„, n > 3. 

Example 1.21. Let H = Z„ be the group of integers mod n, K — Z2 be the 
group of integers mod 2, and let p : K Aut(_ff) be the homomorphism that 
sends to the identity, and 1 to the inverse map of H , given by x 1— )■ x"^, which 
is indeed a group homomorphism of H since H is abelian. Since the subgroup 
of A\xt{H) generated by the inverse map is of order 2, it is isomorphic to K. 
We can thus define the external semi-direct product G ^ Zn x p Z2 . 

We can make observations similar to what we did for direct products. Namely, 
we can identify two isomorphic copies H and K of respectively H and K, given 

by 

H^{{h,lK), heH}, K^{{lH,k), keK}, 
and look at the properties of these subgroups. 

• The subgroup H = {{h, 1), h E H} is normal in H XpK, this can be seen 
by writing down the definition of normal subgroup. (We cannot claim the 
same for Kl). 

• We have HK = H Xp K, since every element {h,k) £ H Xp K can be 
written as {h, 1)(1, k) (indeed ih, 1)(1, k) = ihpi{l), k) = {h, k)). 

• We have i? n A' = {la}- 

This motivates the definition of internal semi-direct products. 
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Definition 1.18. Let G be a group with subgroups H and K. We say that G 
is the internal semi-direct product of H and if if is a normal subgroup of G, 
such that HK = G and H n K = {1g}- It is denoted by 

G = H -A K. 

Example 1.22. The dihedral group Z)„ is the group of all reflections and ro- 
tations of a regular polygon with n vertices centered at the origin. It has order 
2n. Let a be a rotation of angle 27r/n and let & be a reflection. We have that 

D.n^{a'b>, 0<z<n-l, .?=0,1}, 

with 

a" = 6^ ^ [haf = 1. 

We thus have that (a) = C„ and (&) ~ C2, where C„ denotes the cyclic group 
of order n. 

The geometric interpretation of Dn as symmetries of a regular polygon with 
n vertices holds for n > 3, however, note that when n = 2, we can still look 
at the relations defined above: we then have = = (ba)^ = 1, thus D2 
contains only 4 elements, the identity and 3 elements of order 2, showing that 
it is isomorphic to the Klein group C2 x C2. 

To prove, for n > 3, that 

Dn ~ C„ y> C2, 

we are left to check that (a) n (6) — {1} and that (a) is normal in £)„. The 
former can be seen geometrically (a reflection cannot be obtained by possibly 
successive rotations of angle 2tt /n, n > 3). For the latter, we first show that 

bab^^ e (a), 

which can be easily checked, since (ba)^ = baba = 1, thus bab = a^^ = bab^^ 
using that b^ = 1. This also shows that ba ~ a^^b from which we have: 

ba^b-^ = baab-^ = a-\bab-^) G (a), 

similarly 

ba^b-^ = baa'^b-^ = a'^iba^b-^) e (a). 

Again similarly to the case of direct products, these assumptions guarantee 
that we can write uniquely elements of the internal semi-direct product. Let us 
repeat things explicitly. 

Lemma 1.21. Let G be a group with subgroups H and K . Suppose that G ~ 
HK and H C] K ~ {1g}- Then every element g of G can be written uniquely in 
the form hk, for h € H and k G K . 



Proof. Since G = HK, we know that g can be written as hk. Suppose it can 
also be written as h'k'. Then hk = h'k' so h'^^h = k'k^^ e H fi K = {!}. 
Therefore h = h' and k = k' . □ 
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The internal and external direct products were two sides of the same objects, 
so are the internal and external semi-direct products, li G = H Xp K is the 
external semi-direct product of H and K, then H = H x {1} is a normal 
subgroup of G and it is clear that G is the internal semi-direct product of 
H X {1} and {1} x K. This reasoning allows us to go from external to internal 
semi-direct products. The result below goes in the other direction, from internal 
to external semi-direct products. 

Proposition 1.22. Suppose that G is a group with subgroups H and K , and 
G is the internal semi-direct product of H and K. Then G ^ H x p K where 
p : K Aut(iJ) is given by pk(h) — khk~^ , k € K, h G H. 

Proof. Note that pk belongs to Aut{H) since H is normal. 

By the above lemma, every element g of G can be written uniquely in the 
form hk, with h € H and k £ K . Therefore, the map 

ip:HXpK^G, ip{h,k)^hk 

is a bijcction. It only remains to show that this bijection is a homomorphism. 
Given {h, k) and (h' , k') in H Xp we have 

ip{{h, k)(h', k')) = ip{{hpk{h'),kk')) = ip{hkh'k-\kk') = hkh'k' = (^(/i, k)^{h\ k'). 

Therefore (p is a, group isomorphism, which concludes the proof. □ 

In words, we have that every internal semi-direct product is isomorphic to 
some external semi-direct product, where p is the conjugation. 

Example 1.23. Consider the dihedral group Dn from the previous example: 

Dn ^ C„ X 6*2. 

According to the above proposition, D„ is isomorphic to an external semi-direct 
product 

Dn — Cn X p C2 , 

where 

p : C2 ^ Aut(C„), 
maps to the conjugation in Aut(C„). We have explicitly that 

1 1-^ pi = Idc„, b 1-^ pb, pb{a) = bab^^ = a^^. 

In fact, we are back to Example 1.21! 

Before finishing this section, note the following distinction: the external 
(semi-)direct product of groups allows to construct new groups starting from 
different abstract groups, while the internal (semi-)direct product helps in ana- 
lyzing the structure of a given group. 
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\G\ G abclian G non-abelian 

1 {1} ^ 

2 C2 

3 C3 

4 C4, C2 X C2 

5 C5 

6 Ce = C3 X C2 D3 = C3 X C2 

7 C7 



8 Cg, Ci X C2, C2 X C2 X C2 Z?4 = X C2 



Table 1.2: C„ denotes the cyclic group of order n, D„ the dihedral group 

Example 1.24. Thanks to the new structures we have seen in this section, 
we can go on our investigation of groups of small orders. We can get two new 
groups of order 6 and 4 of order 8: 

• C3 X C2 is the direct product of C3 and C2. You may want to check that 
it is actually isomorphic to Cg. 

• The dihedral group 1)3 = C3 x C2 is the semi-direct product of C3 and 
C2. We get similarly D4 = C4 x C2. 

• The direct product C4 x C2 and the direct product of the Klein group 
C2 X C2 with C2. 

The table actually gives an exact classification of groups of small order (ex- 
cept the missing non-abelian quaternion group of order 8), though we have not 
proven it. The reason why the quaternion group of order 8 is missing is exactly 
because it cannot be written as a semi-direct product of smaller groups (see 
Exercises). 

1.7 Permutations and Group action 

Since we introduced the definition of group as a set with a binary operation 
which is closed, we have been computing things internally, that is inside a group 
structure. This was the case even when considering cartesian products of groups, 
where the first thing we did was to endow this set with a group structure. 

In this section, we wonder what happens if we have a group and a set, which 
may or may not have a group structure. We will define a group action, that is a 
way to do computations with two objects, one with a group law, not the other 
one. 

As a first result, we will prove the so-called Cayley's theorem, whose proof 
will motivate the introduction of group action. Since the statement of this theo- 
rem uses permutation groups, we start by recalling the necessary definitions. We 
will give enough background on permutations to define the alternating group, a 
group which is useful as an illustration of many concepts! 
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Definition 1.19. A permutation of a set S' is a bijection on S. The set of 
all such functions (with respect to function composition) is a group called the 
symmetric group on S. We denote by Sn the symmetric group on n elements. 

Example 1.25. Consider the symmetric group ^3 of permutations on 3 ele- 
ments. It is given by (note here that by ab we mean that we first apply the 
permutation b, then a) 



e 


123 - 


^ 123 


or 





a 


123 - 


> 213 


or 


(12) 


b 


123- 


^ 132 


or 


(23) 


ba 


123 - 


^ 312 


or 


(132) 


ab 


123- 


^ 231 


or 


(123) 


aba 


123 - 


^ 321 


or 


(13) 



One can check that this is indeed a group. The notation (132) means that the 
permutation sends 1 to 3, 3 to 2, and 2 to 1. 

We can generally write a permutation on m elements as (ii, . . . , im), which 
is called a cycle notation. The permutation (ii, . . . , im) is called an m-cycle. 
When m = 2, a 2-cycle is called a transposition. Note that several different 
cycles can represent the same permutation (e.g., (132) = (321) = (213)), and 
not every permutation is a cycle. For example, if we consider 1 1— > 2, 2 i— > 
1, 3 i-> 4, 4 1-^ 3, this permutation is not a cycle. However, it is clearly the 
product of two disjoint cycles, namely (12)(34). Formally, we say that two cycles 
(ii, . . .,is) and (ji, . . . , jt) are disjoint if and only if {ii, . . . ,«s}n {ji, . . . , jt} is 
empty. Such a decomposition of a permutation into product of disjoint cycles 
is true in general. 

Proposition 1.23. Every element of Sn can be expressed uniquely as a product 
of disjoint cycles, up to ordering of the cycles, and notational redundancy within 
each cycle. Furthermore, every cycle can be written as a product of transposi- 
tions. 

Proof. Let a be an element of Sn- Choose any index ii £ {l,...,7i}. By 
applying a repeatedly on ii, we construct a sequence of elements of {1, ... , n}: 
ii, 12, . . ., where ij ~ a{ij-i) for j > 2. If we let j grow, this sequence necessarily 
contains repetitions: suppose that the fcth term is the first one which is repeated, 
that is ife = ij with j < k. But this means that both ik-i and ij-i are mapped 
to ik by cr, and since cr is a bijection, two elements cannot be mapped to ik, 
thus it must be that ik = ii (the only element which has not yet a prcimage). 
We then see that a defines a k ~ 1-cyclc 

{ii, . . . ,ife_i). 

If fc — 1 = n, we are done. If not, we take i[ another index not covered by the 
first cycle, and iterate. This second cycle has to be disjoint from the first one, 
since cr is a bijection. We then obtain a disjoint cycle representation for cr. 
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Now take an Z-cycle (ii, . . . , i/). It can be rewritten as 
(ii, . ..,«/) = {ii i2){i2 is) • ■ ■ {k-i k)- 

Indeed, start with the right hand-side: ii and arc swapped. Thus (ii, . . . , ii) 
is mapped to an ^-tuplc whose last 2 terms are ii,ii-i. The next transposition 
is (*i-2j*!-i)j thus both terms are swapped, and we now have as last 3 terms 
i;, i/_2. The next swap will yield 

ii_2,«i-l,«i,«i-3 

and by iterating this process, we reach the last swap {ii, 12), that is 

12, is, ■ ■ ■ ,ii,h 

as we wanted to prove. □ 

The representation has a product of transpositions is not unique, for example 

(2,5,3,6) = (2,6)(2,3)(2,5) = (5, 2)(3, 5)(6, 3) = (1, 7)(2, 6)(2, 3)(2, 5)(1, 7). 

We can however define an invariant of a permutation, called the parity. 

Definition 1.20. An element of 5„ is said to be even if it can be expressed as 
a product of an even number of transpositions. It is said odd otherwise. 

For this definition to make sense, parity of an element of Sn should be unique, 
which it is. 

Theorem 1.24. For n > 2, every element of Sn has a unique parity, even or 
odd. 

Proof. To prove this, we need to introduce some ordering on the permutations. 
We call the switching number of a permutation a the number of ordered pairs 
with i < j but a{i) > (t(j). The switching number is an invariant of a 
permutation. Let t be the switching number of cr, and let r be an arbitrary 
transposition, say t ~ {i j). Without loss of generality, we may assume that i 
comes before j in the permutation 

cr(l), . . . ,cr(n). 

By applying r to cr, we switch i and j, and the picture now looks like 

(1,2,..., a-i(z), . . . , a-i(j), . . . , n) ^ (^(1), a(2), . . . , . . . , j, . . . , a{n)) 

A (a(l),a(2),...,r(i),...,T(j),...,(7(n)) 

(where the first vector is ordered, but not the second and the third). 

To understand the effect of the transposition t on the switching number of 
cr (that is we are computing the switching number of ra and sec how it differs 
from that of cr), we need to remember that we are looking at all the ordered 
pairs (fc, /), k < I, in (1,2,..., n): 
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1. For the ordered pair {a~^ (i) , <7~^ {j)) , when applying a, either (a) the 
ordering is preserved (i.e., i < j), the switching number thus does not 
change, however when applying r, the ordering is reversed, and thus t 
increases by 1, or (b) the ordering is changed, but t changes again the 
ordering, so that t decreases by 1. 

2. Let us now assume that i < j (if not do the same with j > i). Then 
for every index I such i < I < j, wc can look at the non-ordered pairs 

and It might be that <J~^{i) is cither greater or smaller than 

a~^{l), yielding one ordered pair or the other, and similarly for cr^^(j) and 
a~^{l). Thus each ordered pair might or not contribute to the switching 
number of cr, but after r is applied, i and j are reversed, and thus both 
{i,l) and {l,j) are changed at once. Thus the switching number increases 
by 2, decreases by 2, or does not change. We can write down the cases 
explicitly: 

(a-i(»),a"i(0),(a-i(0,a-i(j)) A A {j,i)Ai,^), i<i<j 

thus the switching number of cr is t including no switch for these 2 pairs, 
and that of rcr has two switches for these 2 paires, thus is of t + 2. 

and the switching number of a is here t including one switch for the 
second pair, and that of rcr has one switch for the first pair, but none for 
the second, thus a total of t. The case (cr"^(Z), cr~^(i)), {a~^{l),(j~^{j)) 
also gives t, and finally 

{a-\l),<J-^z)), (a-i(j),a"i(0) A A (»,/), t<l<j 

has a switching number of t for a including two switches for these 2 pairs, 
and rcr has no switch, thus a total of t — 2. 

3. All the non-ordered pairs (A:,/), where I < i < j and fc < Z or fc > /, or 
I > j > i and k < I or k > I (that is all the cases not conisdered so far) do 
not induce any change in the switching number, since by swapping i and 
j, we do not change the ordering of the pairs. 

This shows that given a permutation a with switching number t, composing with 
one transposition always changes the parity of the switching number. Since the 
switching number is invariant, this means that it always takes an even number of 
transpositions applied to cr to have a chance to keep the same switching number. 
This establishes that the parity of a permutation is either even or odd, but not 
both. □ 

Definition 1.21. The set of even permutations forms a subgroup of Sn called 
the alternating group, denoted by A„. 
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Note that if r is any odd permutation, then the coset tA„ consists entirely 
of odd permutations, and conversely, if a is an odd permutation, then T~^a is 
even, so cr G tA„. This shows that |A„| = |S'„|/2. 

We will encounter the alternating group in the future, but for now, we only 
need to recall the definition of the symmetric group Sn to prove Cayley's The- 
orem. 

Theorem 1.25. (Cayley's Theorem.) Every group is isomorphic to a group 
of permutations. 

Proof. Let Sg be the group of permutations of G. Wc will prove that every 
group is isomorphic to a subgroup of Sg- The idea is that each element g & G 
corresponds to a permutation of G, namely we need to define a map from G to 
Sg- 

\:G^Sg, g^ A(.g) = \g 
and since \g is a bijection on G, we need to define what Xg does: 

Ag : G -> G, Xg{x) = gx. 

For justifying that \g is indeed a bijection, it is enough to sec that g~^ exists 
since G is a group (try to write down the definition of injection and surjection). 

We are left to check that A is an injective group homomorphism. Injectivity 
again comes from G being a group, for if \g(x) = Xh{x) for all a; G G, then it 
has to be true that gx ~ hx when a; = 1. 

Now 

(A(a) o X{b)){x) = {Xa o Xb){x) = a{hx) = A„b(x) = X{ah){x) 
for all X, so that A(a) o A(6) = X{ah) which concludes the proof. □ 

Examples 1.26. 1. Consider the group {0,1} of integers modulo 2. The 
group element corresponds to the identity permutation, while the group 
element 1 corresponds to the permutation (12). 

2. Let us consider the group {0, 1,2} of integers modulo 3 to get a less simple 
example. Again corresponds to the identity permutation, 1 corresponds 
to the permutation (123), and 2 to the permutation (132). To see that it 
makes sense, you may want to check that the arithmetic works similarly 
on both sides. For example, we can say that 1 + 1 = 2 on the one hand, 
now on the other hand, we have (123) (123) = (132). 

3. One can check that the dihedral group of order 6 is isomorphic to 5*3 
(this can be done for example by working out the multiplication table for 
each group). 

The key point in the proof of Cayley's Theorem is the way the function Ag 
is defined. We see that for x £ G, g "acts" (via Xg) on x by multiplication. 
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Figure 1.2: Arthur Cayley (1821-1895): he was the first to define the eonecpt of 
a group in the modern way. Before him, groups referred to permutation groups. 

Definition 1.22. The group G acts on the set X if for all g E G, there is a 
map 

G X X ^ X, {g,x) 1-^ g ■ X 

such that 

1. h ■ {g ■ x) — (hg) ■ x for all g,h G G, for all a; G X. 

2. 1 • x = a; for all a; € X. 

The first condition says that we have two laws, the group law between ele- 
ments of the group, and the action of the group on the set, which are compatible. 

Examples 1.27. Let us consider two examples where a group G acts on itself. 

1. Every group acts on itself by left multiplication. This is called the regular 
action. 

2. Every group acts on itself by conjugation. Let us write this action as 

g ■x = gxg^'^. 

Let us check the action is actually well defined. First, we have that 
h- (g-x) = h- {gxg^'^) = hgxg^^h^^ {hg)xg~^h~'^ = {hg) ■ x. 
As for the identity, we get 

1 • a; = 1x1"^ = X. 



36 



CHAPTER 1. GROUP THEORY 



Similarly to the notion of kernel for a homomorphism, we can define the 
kernel of an action. 

Definition 1.23. The kernel of an action G x X X, {g,x) t-^ g ■ x is given 

by 

Ker = {g E G, g ■ x = x for all x}. 

This is the set of elements of G that fix everything in X. When the group G 
acts on itself, that is X ~ G and the action is the conjugation, we have 

Ker = {g G G, gxg^^ = x for all x} = {g G G, gx ~ xg for all x}. 

This is called the center of G, denoted by Z{G). 

Definition 1.24. Suppose that a group G acts on a set X. The orbit B{x) of 
x under the action of G is defined by 

B{x) = {g -x, g E G}. 

This means that wc fix an element x E X, and then we let g act on x when g 
runs through all the elements of G. By the definition of an action, g ■ x belongs 
to X, so the orbit gives a subset of X. 

It is important to notice that orbits partition X. Clearly, one has that 
X = \JxexB{x). But now, assume that one element x oi X belongs to two 
orbits B{y) and B{z), then it means that x = g ■ y = g' ■ which in turn 
implies, due to the fact that G is a group, that 

y = g^^g' ■ z, {g'Y^g ■ y- 

In words, that means that y belongs to the orbit of z, and vice- versa, z belongs to 
the orbit of j/, and thus B{y) = B{z). We can then pick a set of representatives 
for each orbit, and write that 

X = UB{x), 

where the disjoint union is taken over a set of representatives. 

Definition 1.25. Suppose that a group G acts on a set X. We say that the 
action is transitive, or that G acts transitively on X if there is only one orbit, 
namely, for all x,y £ X, there exists g € G such that g ■ x ~ y. 

Definition 1.26. The stabilizer of an element x £ X under the action of G is 
defined by 

Stab(x) = {g £ G, g ■ x ~ x}. 

Given x, the stabilizer Stab(a;) is the set of elements of G that leave x 
fixed. One may check that this is a subgroup of G. We have to check that if 
g, ft- G Stab(a;), then gh~^ £ Stab(x). Now 



[gh ^)-x = g-{h ■ x) 
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by definition of action. Since h € Stab(a::), we have h ■ x = x or equivalently 
X = ■ X, so that 

g ■ {h^^ ■ x) = g ■ X ~ X, 
which shows that Stab(2:) is a subgroup of G. 

Examples 1.28. 1. The regular action (see the previous example) is tran- 
sitive, and for all a; G X = G, we have Stab(x) = {!}, since x is invcrtible 
and we can multiply g ■ x — x hy x~^. 

2. Let us consider the action by conjugation, which is again an action of G 
on itself {X = G): g-x = gxg~^. The action has no reason to be transitive 
in general, and for all x Cz X = G, the orbit of x is given by 

Bix) ^ {gxg~\ g G G}. 

This is called the conjugacy class of x. Let us now consider the stabilizer 

of an element x Cz X: 

Stab(a;) = {g £ G, gxg~^ ^ x} = {g e G, gx = xg}, 
which is the centralizer of x, that we denote by Gq(x). 

Note that we can define similarly the centralizer Gq(S) where S is an arbi- 
trary subset of G as the set of elements of G which commute with everything in 
S. The two extreme cases are: if 5 = {x}, we get the centralizer of one element, 
if 5' = G, we get the center Z(G). 

Theorem 1.26. (The Orbit-Stabilizer Theorem). Suppose that a group G 
acts on a set X . Let B{x) he the orbit of x G X , and let Stab(a;) be the stabilizer 
of X. Then the size of the orbit is the index of the stabilizer, that is 

\B{x)\ = [G : Stab(a;)]. 

// G is finite, then 

\B{x)\ = |G|/|Stab(x)|. 
In particular, the size of an orbit divides the order of the group. 

Proof. Recall first that [G : Stab(x)] counts the number of left cosets of Stab(x) 
in G, that is the cardinality of 

G/Stab(a;) = {gSt&h{x), g £ G}. 

Note that cosets of Stab(a;) are well-defined since we saw that Stab(a;) is a 
subgroup of G. The idea of the proof is to build a function between the sets 
B{x) and G/Stab(x) which is a bijection. That the cardinalities are the same 
will then follow. 

Take y € B{x), that is y ^ g ■ x for some g € G. We define a map 



/ : B{x) G/Stab(a::), y = g ■ x^ g'&tah{x). 
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Before checking that this map is a bijection, we need to check that it is well 
defined. Indeed, for a given y, there is no reason for the choice of g to be unique 
(there is in general no bijection between G and B{x)). Suppose that 

y = gi ■ X ^ g2 ■ X 

then 

g2^gi ■ X = X .giStab(2;) = g2Stab(x). 

The equivalence is the characterization of having two equal cosets. This is 
exactly what we wanted: the image by / docs not depend on the choice of g, 
and if we choose two different g's, their image falls into the same coset. 
The surjectivity of / is immediate. 

We conclude the proof by showing the injectivity. Let us assume that fijji) = 
7(2/2) for yi= gi ■ X e B{x), y2 = g2 ■ x e B{x). Thus 

5iStab(a;) = 52Stab(a;) <=^ gj^^gi G Stab(x) <^=^ g2^gi-x = x <J=> grx = g2-x. 

□ 

Let G be a finite group. We consider again as action the conjugation {X — 
G), given by: g ■ x — gxg~^ . Recall that orbits under this action are given by 

B{x)^{gxg~\ g e G}. 

Let us notice that x always is in its orbit B{x) (take g — \). Thus if we have 
an orbit of size 1, this means that 

gxg^^ ~ X gx = xg 

and we get an element x in the center Z{G) of G. In words, elements that have 
an orbit of size 1 under the action by conjugation are elements of the center. 
Recall that the orbits B{x) partition X: 

X = UB{x) 

where the disjoint union is over a set of representatives. We get 

\G\ = E 1^(^)1 

= \Z{G)\+Y,\B{x)\ 

= |Z(G)| +^[G : Stab(a;)], 

where the second equality comes by splitting the sum between orbits with 1 
element and orbits with at least 2 elements, while the third follows from the 
Orbit-Stabilizer Theorem. By remembering that Stab(a:;) — Cg{x) when the 
action is the conjugation, we can alternatively write 

\G\^\Z{G)\+Y,[G:Cg{x)]. 

This formula is called the class equation. 
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Example 1.29. Consider the dihedral D4 of order 8, given by 

D4 = {1, s, r, r^,r'^, rs, r'^s, r'^s}, 

with = 1, = 1 and srs = r^^. We have that the center Z{D4) of D4 is 
{l,r^} (just check that r^s = sr^). There are three conjugacy classes given by 

{r,r^}, {rs,r^s},{s,r^s}. 

Thus 

\D4\=8=\Z{D4)\ + \B{r)\ + \Birs)\ + \Bis)\. 

The following result has many names: Burnside's lemma, Burnside's count- 
ing theorem, the Cauchy-Frobenius lemma or the orbit-counting theorem. This 
result is not due to Burnside himself, who only quoted it. It is attributed to 
Frobenius. 

Theorem 1.27. (Orbit-Counting Theorem). Let the finite group G act on 
the finite set X , and denote by the set of elements of X that are fixed by g, 
that is X^ = {x € X, g ■ x = x}. Then 

number of orbits = —- \X^\, 
G ' ' 
' ' geG 

that is the number of orbits is the average number of points left fixed by elements 
ofG. 

Proof. We have 

Y.\X^ = \{{g,x)eGxX, g-x = x}\ 
geG 

= ^|Stab(x)| 
= ^|G|/|i?(x)| 

by the Orbit-Stabilizer Theorem. We go on: 

J2\G\/\Bix)\ = \G\Y,l/\B{x)\ 

Be set of orbits xeB ' ' 

= \G\ E 1 

Be set of orbits 

which concludes the proof. Note that the second equality comes from the fact 
that we can write X as a disjoint union of orbits. □ 
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1.8 The Sylow theorems 

Wc look at orders of groups again, but this time paying attention to the occur- 
rence of prime factors. More precisely, we will fix a given prime p, look at the 
partial factorization of the group order n as n ~ p^m where p does not divide 
TO, and study the existence of subgroups of order p or a power of p. In a sense, 
this is trying to establish some kind of converse for Lagrange's Theorem. Recall 
that Lagrange's Theorem tells that the order of a subgroup divides the order of 
the group. Here we conversely pick a divisor of the order of the group, and we 
try to find a subgroup with order the chosen divisor. 

Definition 1.27. Let p be a prime. The group G is said to be a p-group if the 
order of each element of G is a power of p. 

Examples 1.30. We have already encountered several 2-groups. 

1. We have seen in Example 1.15 that the cyclic group C4 has elements of 
order 1,2 and 4, while the direct product C2 x C2 has elements of order 1 
and 2. 

2. The dihedral group D4 is also a 2-group. 

Definition 1.28. If \G\ = p^m, where p does not divide to, then a subgroup P 
of order p*" is called a Sylow p-subgroup of G. Thus P is a p-subgroup of G of 
maximum possible size. 

The first thing we need to check is that such a subgroup of order p^ indeed 
exists, which is not obvious. This will be the content of the first Sylow theorem. 
Once we have proven the existence of a subgroup of order p^, it has to be a 
p-group, since by Lagrange's Theorem the order of each element must divide p^. 

We need a preliminary lemma. 

Lemma 1.28. If n ^ p^m where p is prime, then (^) = to mod p. Thus if p 
does not divide to, then p does not divide [pr) ■ 

Proof. We have to prove that 

^ = TO mod p, 

after which we have that if p does not divide to, the to ^ mod p implying 
that (^t) ^ mod p and thus p does not divide (^J-) . 

Let us use the binomial expansion of the following polynomial 

P'' / r\ 

{x + lf" ^Y^r^jxP^-'^l'' = xP'' + 1 modp 

where we noted that all binomial coefficients but the first and the last are 
divisible by p. Thus 

{x + 1)P^"' = {xP" + 1)" mod p 
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which we can expand again into 

E (";">'■•"-' -!:(':)<-'•)•"-' -i!'^ 

We now look at the coefBcicnt of x'^ on both sides: 

• on the left, take k ~ p^{m — 1), to get (^p™), 

• on the right, take fc = m — 1, to get {^^i) = rn. 

The result follows by identifying the coefficients of a;^ . □ 

We are ready to prove the first Sylow Theorem. 

Theorem 1.29. (1st Sylow Theorem). Let G be a finite group of order p'^m, 
p a prime such that p does not divide m, and r some positive integer. Then G 
has at least one Sylow p-suhgroup. 

Proof. The idea of the proof is to actually exhibit a subgroup of G of order p"^ . 
For that, we need to define a clever action of G on a carefully chosen set X. 
Take the set 

X = {subsets of G of size p^} 

and for action that G acts on X by left multiplication. This is clearly a well- 
defined action. We have that 

which is not divisible by p (by the previous lemma). Recall that the action of 
G on X induces a partition of X into orbits: 

X = UB{S) 
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where the disjoint union is taken over a set of representatives. Be careful that 
here 5" is an element of X, that is S* is a subset of size . We get 

1^1 = E 1^(^)1 

and since p does not divide \X\, it docs not divide J2 \^{'^)\^ meaning that there 
is at least one S for which p docs not divide |i?(S')|. Let us pick this S, and 
denote by P its stabilizer. 

The subgroup P which is thus by choice the stabilizer of the subset S G X 
of size p^ whose orbit size is not divisible by p is our candidate: we will prove 
it has order p^ . 

\P\ > p'' . Let us use the Orbit-Stabilizer Theorem, which tells us that 

\B{S)\^\G\/\P\^fm/\P\. 

By choice of the S we picked, p does not divide \B{S)\^ that is p does 
not divide p'^m/\P\ and |P| has to be a multiple of p^ , or equivalently p^ 
divides \P\. 

\P\ < p^. Let us define the map A^,, x G 5, by 

Xx : P ->■ S, \x{g) = gx. 

In words, this map goes from P, which is a subgroup of G, to S, which is 
an element of X, that is a subset of G with cardinality p"^ . Note that this 
map is well-defined since gx € S for any x € S and any g G P by definition 
of P being the stabilizer of S. It is also clearly injective {gx = hx implies 
g = h since x is an element of the group G and thus is invertible) . If we 
have an injection from P to 5, that means |P| < 15*1 = p^- 

□ 

Example 1.31. Consider the general group G = Gi„(Fp) of n x n invertible 
matrices with coefficients in Fp, which denotes integers mod p, p & prime. Let 
us compute a Sylow p-subgroup of G. For that, we first need to know the 
cardinality of G. This is a classical combinatorial computation: to build an 
invertible matrix with coefficients in Fp, the first column can be anything but 
the whole zero vector, thus — 1 choices, the 2nd column can be anything 
but a multiple of the first column, thus p" — p choices, the 3rd column can be 
anything but a linear combination of the first 2 columns, thusp"— choices,..., 
thus the cardinality is 

|G| - |GL„(Fp)| = 

- (p" - - l)p2(p"-^ - 1) • • 1) 
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Figure 1.4: Augustin Louis Cauchy (1789-1857) 



Clearly p"("~i)/2 jg highest power of p that divides G, we thus have to find a 
subgroup of G of that order. Consider the set U{p, n) of n x n upper triangular 
matrices with every diagonal coefficient at 1 and elements of Fp above. This is 
clearly a subgroup of G, since such matrices are invertible and form a group. 
Its cardinality is ^"("^i)/^ as wanted (there are n{n — l)/2 coefficients above 
the diagonal, which can take any value mod p). 

Corollary 1.30. (Cauchy Theorem). If the prime p divides the order of G, 
then G has an element of order p. 

Proof. Let P be a Sylow p-subgroup of G (which exists by the 1st Sylow Theo- 
rem), and pick a; 7^ 1 in P. The order \x\ of x is a power of p by definition of a 
p-group, say \x\ ^ p^ . Then x^' has order p. □ 

The above corollary gives some converse to Lagrange's Theorem. The one 
below gives an alternative definition of a finite p-group. It is tempting to use it 
as a definition of p-group, however it cannot be used for infinite groups. 

Corollary 1.31. A finite group G is a p- group if and only if the order of G is 
a power of p. 

Proof. If the order of G is not a power of p, then it is divisible by some other 
prime q, in which case G contains an element of order q by Cauchy's Theorem, 
which contradicts the definition of p-group. 

The converse is clear using Lagrange's Theorem. □ 

Example 1.32. Let us consider again the group H = U{p,n) of n x n upper 
triangular matrices with every diagonal coefficient at 1 and elements of Fp above 
computed in Example 1.31. We know that \H\ = p"("~i)/2^ thus it is ap-group. 
Now let us consider the union 

y U{p,n). 

n>l 
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This forms an infinite p-group, since every element has order a power of p. 

Now that we know that at least one Sylow p-subgroup exists, let us derive a 
result on the number Up of Sylow p-subgroups in a group G. 
We need again a preliminary lemma. 

Lemma 1.32. Let H and K be arbitrary finite subgroups of a group G. Then 

\HK\ l^ll^l 



\Hr\K\ 



Note that if K is further assumed to be normal in G, then this result can be 
deduced from the second isomorphism Theorem. 

Proof. Consider the map 

f : H X K ^ HK, {h, k) ^ hk. 

Since / is surjective, \HK\ < \H x K\ < oo since H and K are finite, and thus 
HK is finite. Let /iifci, . . . , hdk^ be the distinct elements of HK. Then H x K 
is the disjoint union of the f~^(hiki), i = \, . . . ,d. Now we can check that 

f-\hk) = {{hg,g~^k), geHHK} 

and this set has cardinality 

\f-\hk)\ = \HnK\. 

Thus 

\H X K\= d\Hr\K\ 
which concludes the proof. □ 

Theorem 1.33. (2nd Sylow Theorem). Let G be a finite group of order 
p^m, p a prime such that p does not divide m, and r some positive integer. 
Denote by Up the number of Sylow p-subgroups of G. Then 

Up = 1 mod p. 

Proof. Consider the set 

X = { all Sylow p — subgroups of G } 

whose cardinality \X\ is denoted by Up. By the 1st Sylow Theorem, this set is 
non-empty and there exists at least one Sylow p-subgroup P in X, whose order 
is p''. We can thus let P act on X by conjugation, i.e., g ■ Q ~ gQg~^, g G P, 
Q G X. Note that in the case where P is the only Sylow p-subgroup. then we 
can take Q = P. 

By the Orbit-Stabilizer Theorem, we have that the orbit B{Q) of any Sylow 
p-subgroup Q in X has cardinality 



\B{Q)\ = |F|/|Stab(g)| =p7|Stab(Q)|. 
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In particular, the size of an orbit of any Sylow p-subgroups divides , meaning 
it has to be either 1 or a power of p. 

Let us recah that the set X is partitioned by the orbits B{Q) under the 
action of P, so that the cardinahty of X is: 

\x\ = = E + E 

where Q' and Q" denote subgroups whose orbit has respectively one element or 
at least two elements. Since p divides the second sum, we have 

\X\ = number of orbits of size 1 mod p. 

To conclude the proof, we thus have to show that there is only one Sylow p- 
subgroup whose orbit has size 1, namely P itself (it is clear that P has an orbit 
of size 1, since conjugating P by itself will not give another subgroup). 

Let us assume there is another Sylow p-subgroup Q whose orbit has only 
one element, namely (recall that one element is always in its orbit): 

gQg-^ ==Q, ge P, 

which translates into 



gQ = Qg for all geP ^ PQ = QP. 
This easily implies that PQ is a subgroup of G, and by the previous lemma 

p^p^ 



\PQ\ 



\PnQ\ 



implying that \PQ\ is a power of p, say for some c which cannot be bigger 
than r, since \G\ ~ p^m. Thus 

P"- ^\P\<\PQ\<P' 

so that \PQ\ ~ p^ and thus \P\ = \PQ\, saying that Q is included in P. But 
both Q and P have same cardinality oi p^\ so Q = P. □ 

The third of the Sylow Theorems tells us that all Sylow p-subgroups are 
conjugate. 

Theorem 1.34. (3rd Sylow Theorem). Let G he a finite group of order 
p^m, p a prime such that p does not divide m, and r some positive integer. 
Then all Sylow p-subgroups are conjugate. 

Proof. Let P be a Sylow p-subgroup of G and let i? be a p-group of G. We will 
prove that R (being a p-group in general) is contained in a conjugate of P. 
Let R act by multiplication on the set Y of left cosets of P: 



Y = {gP, g e G}. 
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It is a well-defined action (it is multiplication in the group G). 

We want to prove that there is an orbit of size 1 under this action. By 
Lagrange's Theorem, we know that 

|F| = |G|/|PH^^m 

and thus p docs not divide |y| by assumption on m. By writing that we have a 
partition of Y by its orbits, we get 

\Y\ = J2\B{y)\ 

and there exists one orbit B{y) whose size is not divisible by p. By the Orbit- 
Stabilizer Theorem, we have that the size of every orbit divides which has 
order a power of p (by a corollary of the 1st Sylow Theorem), so every orbit size 
must divide p, which gives as only possibility that there is an orbit of size 1. 
Let gP S y be the element whose orbit size is 1. We have 

h- gP = gP 

for h E R, since gP belongs to its orbit. Thus 

g-'hg eP ^ he gPg-' 

for all h in R. We have just proved that the p-group R is contained in a conjugate 
of P. 

All we needed for the proof is that i? is a p-group, so the same proof holds 
for the case of a Sylow p-subgroup, for which wc get that R is contained in a 
conjugate of P, and both have same cardinality, which concludes the proof. 

Wc will use the fact that the proof works for R a p-group in general for 
proving one corollary. □ 

Corollary 1.35. 1. Every p-subgroup of G is contained in a Sylow p-subgroup. 

2. The number rip of Sylow p-subgroups divides m. 

Proof. 1. Now we know that if P is a Sylow p-subgroup, then so is gPg~^, 
5 e G, by the above theorem. The proof of the theorem itself shows that 
any p-group is included in gPg'^^ and we are done. 

2. Let the group G act by conjugation on the set of its subgroups. In partic- 
ular, G acts on the Sylow p-subgroup P, and the orbit of P has size the 
number of Sylow p-subgroups in G, denoted by np. By the Orbit-Stabilizer 
Theorem, rip divides |G| = p^m. But p cannot be a prime factor of rip 
since rip = \ mod p, from which it follows that rip must divide m. 

□ 
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1.9 Simple groups 

We will now see a few applications of the Sylow Theorems, in particular to 
determine the structure of so-called simple groups. 

Definition 1.29. A group G is simple if G 7^ {1} and the only normal subgroups 
of G are G itself and {1}. 

Finite simple groups are important because in a sense they are building 
blocks of all finite groups, similarly to the way prime numbers are building 
blocks of the integers. This will be made clearer in the coming section by 
the Jordan-Holder Theorem. Infinite simple groups exist and can be found for 
example among Lie groups, but we will concentrate here on finite groups. 

The case of simple abelian groups is easy to understand. Suppose that G 
is a simple abelian group. Note that G cannot be {1}. Now G being abelian, 
all its subgroups are normal. On the other hand, G being simple, its only 
normal subgroups are {1} and itself, leaving as only solution that G has only 
two subgroups, namely {1} and G. Thus G has to be a cyclic group of prime 
order. 

We now start looking at non-abelian simple groups. We start with some 
preliminary results. 

Proposition 1.36. If P is a non-trivial finite p-group, then P has a non-trivial 
center. 

Proof. Let P act on itself by conjugation. The orbits of this action are the 
conjugacy classes of P, and we have that x belongs to an orbit of size 1 if and 
only if X belongs to the center Z{P). 

By the Orbit-Stabilizer, the size of any orbit must divide |P|, which is a 
power of p by a corollary of the 1st Sylow Theorem. 

If it were true that the center is trivial, that is Z{P) = {1}, then that means 
there is only one orbit of size 1, and thus all the other orbits must have size 
that divides p, namely they are congruent to mod p. Thus 

|P| = |Z(P)|+^|i?|^l modp, 

where the sum is over orbits of size at least 2. This is clearly a contradiction, 
which concludes the proof. □ 

Lemma 1.37. The group P is a normal Sylow p-suhgroup of a group G if and 
only if P is the unique Sylow p- subgroup ofG. 

Proof. We know from the 3rd Sylow Theorem that the Sylow p-subgroups form 
a single conjugacy class. Then P is the unique Sylow p-subgroup means that P 
is the only element in the conjugacy class, and thus it satisfies 

gPg'^ = P. 

for every g € G, which exactly means that P is a normal subgroup of G. 
Conversely, if P is normal, then gPg^^ = P for all g € G, which means that 
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when we conjugate P by any element of G, P is the only Sylow p-subgroup in its 
conjugacy class. Since all the Sylow p-subgroups belong to the same conjugacy 
class, P is the only Sylow p-subgroup. □ 

Thanks to the two above results, we can now prove that a non-abclian simple 
group must have more than one Sylow p-subgroup. 

Proposition 1.38. Let G be a finite group which is non-ahelian and simple. 
If the prime p divides \G\, then the number Up of Sylow p-subgroups is strictly 
bigger than 1. 

Proof. Let us look at the prime factors appearing in the order of G. 

• If p is the only prime factor of |G|, then \G\ must be a power of p, that 
is G is a non-trivial p-group (it is non-trivial by definition of simple and 
a p- group by a corollary of the 1st Sylow Theorem). Now the above 
proposition tells us that its center Z(G) is non-trivial as well. Since Z{G) 
is a normal subgroup of G and G is simple, it must be that G = Z{G), 
which contradicts the assumption that G is non-abelian. 

• We then know that |G| is divisible by at least two distinct primes. So if 
P is a Sylow p-subgroup, then 

{1} < P < G, 

where the second inclusion is strict since the order of G is divisible by two 
primes. 

If there were only one Sylow p-subgroup, namely np — 1, then this Sylow 
p-subgroup would be normal by the above lemma, which contradicts the 
simplicity of G. 

□ 

Let us see if we can be more precise by refining the assumptions on the order 
of the group G we consider. The group G can be either abelian or non-abelian, 
though the results on simplicity are more interesting for non-abelian groups. 

Proposition 1.39. Let G be a group of order pq, where p and q are distinct 
primes. 

1. If q =^ 1 mod p, then G has a normal Sylow p-subgroup. 

2. If both q ^ 1 mod p and p ^ 1 mod q, then G is cyclic. 

3. G is not simple. 

Proof. 1. By Lemma 1.37, saying that G has a normal Sylow p-subgroup is 
the same as saying that there is a unique Sylow p-subgroup. This is now 
indeed the case, since the number Up of Sylow p-subgroups has to satisfy 
both Up = I mod p and Tip | g by the Sylow Theorems. Since q is prime. 
Up is either 1 or q. It cannot be that Up = q, since it would imply that 
(J = 1 mod p which contradicts the assumption. 
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2. By the previous point, the group G has a normal Sylow p-subgroup P 
and a normal Sylow g-subgroup Q, both of them cyclic (since they are of 
prime order) . Let us write them respectively P = (x) and Q ~ (y). Since 
both P and Q are normal, with P n Q = {!}, we have that xy = yx (we 
have seen that before, but the argument goes like that: take the element 
xyx~^y~^ and show that it belongs to P H Q by normality of P and Q). 
Thanks to this commutativity property, we have that {xy)" = x"y" and 
the order of xy is pq, showing that G is cyclic with generator xy. 

3. Without loss of generality, we can assume that p > q so that p docs not 
divide q ~ 1 which can be rewritten as 

q ^ 1 mod p. 

By the first point, we know that G has a normal Sylow p-group, and thus 
G cannot be simple. 

□ 

Here is another family of groups which are not simple. The proof contains 
an interesting combinatorial argument! 

Proposition 1.40. Let G be a group of order \G\ = p^q where p and q are two 

distinct primes. Then G contains either a normal Sylow p-subgroup or a normal 
Sylow q-subgroup. In particular, G is not simple. 

Proof. Recall that having a normal Sylow p-subgroup (resp. q-subgroup) is the 
same as saying there is a unique Sylow p-subgroup (resp. g-subgroup). Suppose 
that the claim is not true, that is both the number of Sylow p-subgroups Hp and 
the number of Sylow g-subgroups Uq are bigger than 1. Let us start this proof 
by counting the number of elements of order q in G. 

If a Sylow g-subgroup has order q, it is cyclic and can be generated by any 
of its elements which is not 1. This gives q — 1 elements of order q per Sylow 
g-subgroup of G. Conversely, if y has order g, then the cyclic group it generates 
is a Sylow g-subgroup, and any two distinct Sylow g-subgroups have trivial 
intersection. Thus 

number of elements of order q ~ nq{q — 1). 

Now we know from the Sylow Theorems that Uq | p^, thus Uq is either p of 
p^ {uq = 1 is ruled out by the fact that we do a proof by contradiction). 

• ^q — p^: then the number of elements of order NOT g is 

p^q-p^iq- 1) =p^ 

On the other hand, if P is a Sylow p-subgroup, then it also contains p^ 
elements, and all of them have order not g, so that we can conclude that 
P actually contains all elements of order not g, which implies that we have 
only one Sylow p-subgroup, yielding the wanted contradiction. 
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\G\ 


abelian 


non-abelian 


P 


Cp simple 


not possible 




not simple 


not simple since ^(G) > 1 


pq 


not simple 


not simple 


p^q 


not simple 


not simple 



Table 1.3: Cp refers to a cyclic group of prime order. 



• Ug = p : We know from Sylow Theorems that 

Hq = I mod q ^ p = 1 mod q ^ p > q, 

but also that 

Up I q 

and since q is prime, that leaves Up = I or Up = q and thus Up = q. As 
before 

Up = I mod p ^ q = I mod p ^ q > p. 
This concludes the proof. 

□ 

We have thus shown that the situation is easy for simple abelian groups. 
For non-abelian groups, we have seen two cases (|G| = pq and \G\ ~ p^q) where 
groups are not simple. To find a non-abelian group which is simple, one has to 
go to groups of order at least 60. Indeed, it has been proven that the smallest 
non-abelian simple group is the alternating group of order 60, this is the 
group of even permutations of a finite set. This result is attributed to Galois 
(1831). It is not an easy task to determine the list of simple groups, and in fact, 
the classification of finite simple groups was only accomplished in 1982 (there 
has been some controversy as to whether the proof is correct, given its length - 
tens of thousands of pages - and complexity). 

1.10 The Jordan-Holder Theorem 

We have mentioned when introducing simple groups in the previous section that 
they can be seen as building blocks for decomposing arbitrary groups. This will 
be made precise in this section. 

Definition 1.30. Let G be a group, and let Go, . . . , G,i be subgroups of G such 
that 

1. G„ = {1} and Go = G, 

2. G,+l<G„^ = 0,...,n-l. 
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Then the series 

{1} = G„ < G„_i < • • • < Go = G 
is called a subnormal series for G. 

Suppose that G^+i is not a maximal normal subgroup of Gi, then we can 
refine the subnormal series by inserting a group H such that G^+i <\ H <l Gi, 
and we can repeat this process hoping it will terminate (it will if G is finite, it 
may not otherwise). 

Definition 1.31. Let G be a group, and let Go, . . . , G„ be subgroups of G such 
that 

1. G„ = {1} and Go = G, 

2. Gi+i < Gi, i = 0, . . . , n — 1, such that G^+i is a maximal normal subgroup 
of G,. 

Then the series 

{1} = G„ < G„_i <•••<] Go = G 

is called a composition scries for G. The factor groups G^/Gi+i are called the 
factors of the composition series, whose length is n. 

Another way of stating the condition G^+i is a maximal normal subgroup of 
Gj is to say that Gi/G^+i is simple, i = 0, . . . , n — 1. To see that asks a little 
bit of work. This result is sometimes called the 4rth isomorphism theorem. 

Theorem 1.41. (Correspondence Theorem). Let N be a normal subgroup 
of G and let H be a subgroup of G containing N. Then the map 

ip : {subgroups of G containing N} — >■ {subgroups ofG/N}, H i— > ipiH) ~ H/N 

is a bijection. Furthermore, H is a normal subgroup of G if and only if H/N is 
a normal subgroup ofG/N. 

Proof. We first prove that -0 is a bijection. 

Injectivity. If Hi/N = H2/N, then cosets in each subgroup are the same, 
that is for any hi € Hi, we have hiN = h2N for some /12 G H2, implying that 
h2^hi G N C H2 and thus hi € H2, showing that Hi C H2. By repeating the 
same argument but reverting the role of Hi and H2, we get H2 C Hi and thus 
Hi — H2. 

Surjectivity. Let Q be a subgroup of G/N and let tt : G — >■ G/N be the 
canonical projection. Then 

7r-i(Q) = {a e G, aN e Q}. 

This is a subgroup of G containing N and 

^{tt-\Q)) = {aN, aNeQ} = Q. 
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We are left to prove that H <\G <^=> 
H <G. For any a G G, we have to show that 

{aN){H/N){aN)-^ 



H/N < G/N. Assume thus that 



H/N. 



Now for any hN £ H/N, we have 



{aN){hN){aN) 



-1 



{aha 



e H/N 



and we are done. 

Conversely, suppose that H/N < G/N . Consider the homomorphism 



which is the composition of the canonical projection tt of G onto G/N, and the 
canonical projection of G/N onto (G/N) /{H/N) (the latter makes sense since 
H/N < G/N). We now want to show that H is the kernel of this map, which 
will conclude the proof since the kernel of a group homomorphism is normal. 

An element a is in the kernel if and only if {aN){H/N) = H/N , that is if 
and only if aN G H/N , or equivalently aN = hN for some h £ H . Since N 
is contained in H, this means aN is in H and thus so is a, which is what we 
wanted to prove. □ 

Let us now go back to the composition series of G. If G/N is simple, then 
by definition it has only trivial normal subgroups, namely N and G/N . Now 
using the Correspondence Theorem, the normal subgroups N and G/N exactly 
correspond to the normal subgroups N and G in G, which shows that N is the 
maximal normal subgroup of G. 

The Jordan-Holder Theorem will tell us that if G has a composition series, 
then the resulting composition length n and the simple composition factors 
Gi/Gi+i are unique up to isomorphism and rearrangement. This for example 
shows that if Gi and G2 are two groups with different composition factors, then 
they cannot be isomorphic. 

Lemma 1.42. Let G be a group with composition series 



Then for any normal subgroup K of G, if we remove the duplicates from the 
series 



the result is a composition series for K of length at most n. 

Proof. We need to show that K n G^+i < K n Gi and that the group {K n 
Gi)/{K n Gi+i) is simple for all i. 

Let X £ K C\ Gi and g £ K f] G^+i. Then xgx~^ £ K since by assumption 
if is a normal subgroup of G, and xgx^^ G G^+i since Gi+i < Gi. Thus 
xgx~^ £ K n Gi+i which proves that K n G^+i <\ K f] Gi. 



a ^ {aN){H/N) 



{1} = Gn < Gn-i < • • • < Go = G. 



{1} = K n Gn <Kn G„_i <---<KnGo^K, 
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We now look at the quotient group {K n Gi)/{K n G^+i). Since d/Gi+i 
is simple, CJi+i is a maximal normal subgroup of G,;, and thus the only normal 
subgroups of Gi that contain Gi+i are Gi and Gi+i. 

Recall that KC\Gi is normal in Gi (it is the kernel of the canonical projection 
oi G to G/K restricted to Gi), so that we get 

G,+i < (XnG,)G,+i <G,. 

For the first normal inclusion, compute that for kg G {K n Gi)Gi+i we have 

kgGi+ig^^k^^ = kGi+ik~^ C G^+i 

since k £ Gi and G^+i is normal in Gi. For the second normal inclusion, we 
have for g £ Gi that 

g{K n G,)G,+ig-' = (A' n GO^G.+ig^^ 

since K D Gi is normal in Gi and 

(A- n G,)gG,+ig-' C (A' n G,)G,+i 

since G^+i < G^. 

Thus either G^+i = (iiTn Gi)Gi+i or {K n Gi)Gi+i = Gi. Using the second 
isomorphism theorem (with Gj+i < G^ and {K O Gi) < Gi), we have 

{K n G,OG,+i/G,+i ~ (/V n G,)/(/v n G, n G,+i) = (A' n GO/(a- n G,+i). 

We can see that if Gi+i = (A' n Gi)Gi+i, then K n G; = ATn G^+i and we have 
a duplicate to remove. If (AT n Gi)Gi+i = Gi, then 

G,/G,+i ^ (A'nGO/(if nG,+i) 

and thus (AT n Gi)/(A' n G^+i) is simple. □ 

Theorem 1.43. (Jordan-Holder Theorem). Let G be a group that has a 
composition series. Then any two composition series for G have the same length. 
Moreover, if 

{1} = G„ < G„_i < • • • < Go = G 

and 

{1} =Hn<: Hn-i < • • • < i/o = G 
are two composition series for G, there exists a permutation r such that Gi/Gi^i ~ 

Ht(i)/ Hr{i) + l- 

Proof. The proof will be on induction on the length of a composition series. 
Suppose that G is a group with a composition series of length 1. Then the 
subnormal series 

Gt>{l} 
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Figure 1.5: Camille Jordan (1838-1922) and Otto Holder (1859-1937) 



cannot be refined, so it must be a composition series. In particular G ~ G/{1} 
is simple. This is also the only composition series for G and so all the assertions 
are true for length 1. 

Suppose now that n > 1 and that the claims are true for composition series 
of length up till n — 1. Let G be a group with composition series of length n, 
say 

{1} = G„ <G„-i <•••<] Go = G 
(so that Gi Gi+i for each i). Now let 

{1} = H,n < H,n-i <Ha = G 

be a composition series for G (again Hi ^ i/i+i for each i). 

We first have to show that m = n after which we discuss the unicity of the 
decomposition. 

(Proof that m = n). The idea of the proof goes as follows: to use the 
induction hypothesis, we need to get a composition scries of length smaller than 
n, that is, we need to identify the first composition factors, which we will using 
the above lemma. Concretely, we first exclude the case when Gi = iJi, then 
compute a composition scries of length n — 2 for Hi H Gi, which will indeed be 
the second composition factor. We then use the second composition scries of G 
to get another composition series for Hi n Gi whose length depends on jti, that 
we can compare to the known one. 

If Gi = Hi , then by the induction hypothesis applied to Gi , we have n — 1 = 
m — 1, we have a suitable permutation r of the n — 1 factors, and we are done. 

Suppose then that Hi ^ Gi. Since both Gi and Hi are maximal normal in 
G, we see that Hi <] GiHi <\ G with Hi ^ GiHi since we assumed Hi ^ Gi. 
Thus GiHi = G, from which we conclude by the 2nd isomorphism theorem that 



GiHi/Hi ~ G/Hi ~ Gi/{Hi n Gi). 
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Since G/Hi is simple, we get that Gi/{Hi n Gi) is simple as well. Now by the 
above lemma, upon removing duplicates from the series 

{1} = i7i n G„ < • • • < i7i n Go = 

we get a composition scries for Hi of length at most n and thus upon removing 
duplicates 

{1} = HiHGn^--- <Hir\Gi 

is a composition series for Hi n Gi of length at most n — 1. Since Gi/{Hi n Gi) 
is simple, it follows that upon removing duplicates 

{1} = i7i n G„ < • • • < i/i n Gi < Gi 

is a composition series for Gi. But then 

Gi > G2 O • • • O G„ = {1} 

and 

Gi > Hi n Gi ^ iJi n G2 > • • • > ffi n G„ = {1} 

are both composition series for Gi, with the first series of length n — 1. By 
induction hypothesis, both series have the same length. Since Gi ^ Hi O Gi 
(recall that we assumed Hi ^ Gi), any duplication must occur later in the 
series. Let 

Gi = i^i [> K2 = i?i n Gi o A'3 [> • • • o Kn = {1} 

denote the composition series for Gi of length 71 — 1 that results from remov- 
ing the duplicates. By hypothesis, there exists a permutation a such that 
Gi/Gi+i ~ Ka(i) / Kci(i)^i for each i = 1, ... ,n — 1. Set a not to move the 
index 0, then 

G = Go c> Gi o G2 c> • • • c> G„ = {1} 

and 

G = A'o > Gi = Ki O A'2 = i^i n Gi > A3 l> • • • A„ = {1} 

are composition series of length n for G and a is a permutation such that 
Gi/Gi^i ~ Ka{i)/Ka{i)+i for each i = 0, . . . , n — 1. Moreover, we have found a 
composition series for Hi n Gi of length n ~ 2. 

Let us now repeat similar computations for the composition series 

G = Ho>Hil>---l>H„,^{l} 

and the normal subgroup Gi of G. Again by the above lemma, upon removing 
the duplicates from the series 

Gi = i/o n Gi ^ iJi n Gi > • • • > i?„i n Gi {1} 

we obtain a composition series for Gi, so that upon removing the duplicates 
iJinGi>--->i?,„nGi = {1} 



56 



CHAPTER 1. GROUP THEORY 



yields a composition series for HiDGi. Now since Hi n Gi has a composition 
series of length n — 2, namely 

= ffi n Gi c> • • • O Kn = {1}, 

we apply the induction hypothesis to Hi D Gi to conclude that all composition 
series of HiHGi have length n— 2, and so in particular the preceding composition 
series 

HinGi^--->H„^nGi = {1} 

has length n — 2. Wc cannot conclude yet, since we do not know how many 
terms there are in function of m in the above composition series (we need to get 
rid of the duplicates). 

Since wc know from the 2nd isomorphism theorem that Hi /{Hi H Gi) ~ 
HiGi/Gi = Gq/Gi, which is a simple group, it follows that Hi /{Hi n Gi) is 
simple. Thus upon the removal of the duplicates from 

i/i O ifi n Gi > • • • > i?,„ n Gi = {1} 

the result is a composition series for Hi of length n — 1 (we added the term Hi 
to the composition series for Hi D Gi of length n — 2). Also 

Hii>H2i>--->H„, = {1} 

is another composition series for Hi. Since the first series has length n — 1, by 
our induction hypothesis, the second series must also have length n — 1. Since 
its length is m — 1, it follows that m = n. 

(Unicity of the composition factors). Again by induction hypothesis 
on Hi , we have a permutation /? of the n — 1 composition factors (which can 
be extended to n factors by setting /3(0) = 0.) Namely, let Li, i = 1,2, ... ,n 
denote the distinct terms in the series 

iJi o iJi n Gi o iJa n Gi >•••[> i7„ n Gi = {1} 

so that Li = Hi and L2 = Hi O Gi. Then wc have composition series 

G = Hq \> Hi \> ■ ■ ■ Hn = {1} and G = io O Li > • • • L„ = {1} 

of length n for G and there exists a permutation /3of{0,l,...,n — 1} such that 
Hi/Hi+i ~ X/3(i)/i^(i)+i for each i = 0, 1, . . . , n - 1. 

We are almost done but for the fact that we need an isomorphism between 
Hi/Hi+i and G^(i)/G^(j)+i instead of having Hi/Hi+i ~ L^(i)/L^(j)+i . We re- 
call that we already have a permutation a such that Gi/Gi+i ~ J'^a(j)/-^a(i)+i- 
We are thus left to find one between Li/Li+i and Ki/Kij^i. 

Finally, since K2 = L2 = Hi D Gi, we have two composition series for G: 

Gt> Git> i^iHGiO A^o ••• A'„_i[> A:„ = {1} 
Gl> Hit> iJiHGiO LaO ••• i„-il> L„ = {1}. 
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We may apply the induction hypothesis to Hi n Gi to obtain the existence 
of a permutation 7 of {2, 3, . . . , n — 1} such that for each i in this set we have 
Ki/Ki+i ~ We have aheady seen that G/Gi ~ Hi/{Hir\Gi) and 

G/Hi ~ Gi/ (HiDGi), so we may extend 7 to a permutation of {0, 1, ... , 71 — 1} 
by setting 7(0) ~ 1 and 7(1) = 0. Then since 

Ka = G = Lo, A'l = Gi, = Hi, K2 = = H d, 

we have 

Ki/Ki+i ~ L^(i)/L^(j)+i, z = 0, . . . , n - 1. 
In summary, we have m = n, and for r = /3^^ja, we have 

Gi/Gi+i ~ Hr{i)/Hr{i)+i, i — 0, . . . ,n ~ 1. 

This concludes the proof. □ 

Example 1.33. The cyclic group C12 has three composition series 

Gi < G2 < Ge < C12, Ci < C2 < C4 < C12, Ci < C3 < Ce < C12 

and all of them have the same length. Furthermore, the factor groups appearing 
are 

{C2, C3, C2}, {C2, C2, C3}, {C3, C2, C2} 
which are indeed the same up to permutation. 

Corollary 1.44. (Fundamental Theorem of arithmetic). Let n > 1 be a 

positive integer. Then there exist unique primes Pi < P2 < ' ' ' < Pk o,nd unique 
positive integers ri , . . . , such that n ~ PYP2' ' ' ' Pk'' ■ 

Proof. Let G = (g) be a cyclic group of order n. Then every subgroup of G is 
normal, and there is a unique subgroup of size d for each positive divisor d of 
n. Let d be the largest proper divisor of n, and let Gi be the unique subgroup 
of G of size d. Then G/Gi is simple and cyclic, hence of prime order. We may 
repeat this construction on the cyclic subgroup Gi, so by induction, we obtain 
a composition scries 

G = Go < Gi < G2 < • • • < G,n = {1} 

for G with Gi/Gi^i of prime order pi for each i. Thus 

n = \G\ 

- \G/Gi\\Gi\ 

~ IG/G1IIG1/G2I • • • |Gm_i /G„||G 

m I 

= PlP2- ■ -Pm-l- 

The uniqueness of the prime decomposition of n follows from the Jordan-Holder 
Theorem applied to G. □ 
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1.11 Solvable and nilpotent groups 

Let us start by introducing a notion stronger than normality. 

Definition 1.32. A subgroup H of the group G is called characteristic in G if 
for each automorphism / of G, we have 

f{H) = H. 

We may write H char G. 

This is stronger than normal since normality corresponds to choose for / the 
conjugation by an element of g. 

Note that / restricted to H a, characteristic subgroup (denoted by /|_f/) 
is an automorphism of H (it is an endomorphism by definition of H being 
characteristic) . 

Here are a few immediate properties of characteristic subgroups. 
Lemma 1.45. Let G he a group, and let H , K be subgroups of G. 

1. If H is characteristic in K and K is characteristic in G, then H is char- 
acteristic in G (being characteristic is transitive). 

2. If H is characteristic in K , and K is normal in G, then H is normal in 
G. 

Proof. 1. Note that by assumption H < K < G. Let (j) be an automorphism 
of G. Since K is characteristic in G, then 4>{K) = Khy definition, and thus 
01 A' is an automorphism of K. Now (I)\k{H) = H since H is characteristic 
in K. But (f)\K is just the restriction of (recall H < K), so 4'{H) = H. 

2. Consider the automorphism of K given by fc i— gkg~^, g £ G, which is 
well defined since K is normal in G. For any choice of g, we get a different 
automorphism of K, which will always preserve H since H is characteristic 
in K, and thus gHg^^ C H which proves that H is normal in G. 

□ 

Let us introduce a new definition, that will give us an example of character- 
istic subgroup. 

Definition 1.33. The commutator subgroup G' of a group G is the subgroup 
generated by all commutators 

[x,y] = xyx^^y^'^. 
It is also called the derived subgroup of G. 
Let us make a few remarks. 

• By the subgroup generated by all commutators, we mean that by definition 
we take the smallest subgroup of G containing all the commutators. It is 
thus closed by construction. 
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• The product of two or more commutators need not be a commutator. It 
is known that the least order of a finite group for which there exists two 
commutators whose product is not a commutator is 96. 

• Note that the inverse [x,y]^^ of [x,y] is given by [x, j/]^^ = bji^] = 
yxy^^x^^ . 

Here are a hst of properties of the commutator subgroup G". 
Lemma 1.46. Let G' he the commutator subgroup of G. 

1. G' is characteristic in G. 

2. G is abelian if and only if G' is trivial. 

3. G/G' is abelian. 

4. If N is normal in G, then G/N is abelian if and only if G' < N . 

Proof. 1. To show that G" is characteristic in G, we have to show that 
f{G') = G' for / any automorphism of G. Now 

f{[x,y]) = fixyx-'y-') = fix)f{y)f{xr'fiy)-' = [fix),f{y)]. 

2. We have that G" is trivial if and only if xyx^^y^^ = 1 which exactly means 
that xy = yx. 

3. Since G" is characteristic, it is also normal in G, and G/G' is a group. 
We are left to prove it is an abelian group. Take two elements (that is 
two coscts) G'x and G'y in G/G'. We have that G"a;G"y = G'yG'x ^ 
G'xy = G'yx by definition of the law group on G/G' . Now 

G'xy = G'yx <^=^ xy{yx)~^ e G" ■^=^ xyx'^y^^ e G", 

which holds by definition. 

4. Let us assume that is normal in G. We have that G/N is a group, and 
G/N is abelian if and only if for Nx, Ny two cosets we have 

NxNy = NyNx Nxy = Nyx xy{yx)~^ G ■<=J> xyx~^y^^ £ 

which exactly tells that each commutator must be in TV. 

□ 

We can iterate the process of taking commutators: 

G(") = G, G(i) = G', G(2) = (G')', . . . , G('+i) = (G^)', . . . 
The process may or may not reach {1}. 
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Definition 1.34. The group G is said to be solvable if G'^''^ = 1 for some r. 
We then have a normal series 

{1} = G^'') < G('-i) < • • • < G(°' = G 

called the derived scries of G. The term "solvable" historieally refers to Galois 
theory and the question of "solvability" of quintic equations, as we will sec later. 

We have already seen the notion of subnormal series in the previous section. 
By normal scries, wc mean a scric where not only each group is normal in its 
successor, but also each group is normal in the whole group, namely each G^'^ 
is normal in G. We have indeed such series here using the fact that the commu- 
tator subgroup is a characteristic subgroup, which is furthermore a transitivity 
property. 

Let us make a few remarks about the definition of solvable group. 
Lemma 1.47. 1. Every abelian group is solvable. 

2. A group G both simple and solvable is cyclic of prime order. 

3. A non- abelian simple group G cannot be solvable. 

Proof. 1. We know that G is abelian if and only if G' is trivial. We thus get 
the normal series 

G'"' =G[>G(i' 

2. If G is simple, then its only normal subgroups arc {1} and G. Since G' is 
characteristic and thus normal, wc have either G' = {1} or G' = G. The 
latter cannot possibly happen, since then the derived serie cannot reach 
{1} which contradicts the fact that G is solvable. Thus we must have that 
G' = {1}, which means that G is abelian. We conclude by remembering 
that an abelian simple group must be cyclic of order a prime p. 

3. If G is non-abclian, then G' cannot be trivial, thus since G is simple, its 
only normal subgroups can be either {1} or {G}, thus G' must be either 
one of the other, and it cannot be {1}, so it must be G. Thus the derived 
series never reaches {1} and G cannot be solvable. 

□ 

There are several ways to define solvability. 
Proposition 1.48. The following conditions are equivalent. 

1. G is solvable, that is, it has a derived series 

{1} = G^'') < G(''-i) < • • • < G(") = G. 

2. G has a normal series 

{1} = G^<Gr_i <---<Go = G 
where all factors, that is all quotient groups Gi/Gi^i are abelian. 
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3. G has a subnormal series 

{1} = Gr< Gr-l < • • • < Go = G 

where all factors, that is all quotient groups Gi/Gi^i are abelian. 

Proof. That 1. ^ 2. is clear from Lemma 1.46 where we proved that G/G' is 
abehan, where G' is the commutator subgroup of G. 

That 2. 3. is also clear since the notion of normal series is stronger than 
subnormal series. 

What we need to prove is thus that 3. =J> 1. Starting from G, we can always 
compute G', then G(2),....To prove that G has a derived series, we need to 
check that G'-"^ = {1} for some s. Suppose thus that G has a subnormal series 

l^Gr^ Gr-l < • • • < Go = G 

where all quotient groups Gi/Gi+i are abelian. For i = 0, we get Gi < G and 
G/Gi is abelian. By Lemma 1.46, we know that G/Gi is abelian is equivalent 
to G' < Gi. By induction, let us assume that G^*^ < G^, that is taking i 
times the derived subgroup of G is a subgroup which is contained in the ith 
term G,; of the subnormal series, and see what happens with G^'^^^K We have 
that G('+i) = (G('))' < G- by induction hypothesis (and noting that if i7 C G 
then H' C G', since all the commutators in H surely belong to those of G). 
Furthermore, GJ < G^+i since Gi/Gi+i is abelian. Thus G'""' < G^ = {1}. □ 

Let us see what are the properties of subgroups and quotients of solvable 
groups. 

Proposition 1.49. Subgroups and quotients of a solvable group are solvable. 

Proof. Let us first consider subgroups of a solvable groups. If 7J is a subgroup 
of a solvable group G, then H is solvable because ij'^^ < G'-*-' for all i, and 
in particular for r such that ij'-'''' < G^*"-* = {1} which proves that the derived 
series of H terminates. 

Now consider TV a normal subgroup of a solvable group G. The commutators 
of G/N are cosets of the form xNyNx~^Ny~^N = xyx~^y~^ N , so that the 
commutator subgroup {G/N)' of G/N satisfies {G/N)' = G'N/N (we cannot 
write G' /N since there is no reason for to be a subgroup of G'). Induc- 
tively, we have {G/N)^^ = G^''>N/N. Since G is solvable, G^''^ = {1} and thus 
(G/iV)('') = N/N = {1} which shows that G/N is solvable. □ 

Example 1.34. Consider the symmetric group S4. It has a subnormal series 

{1} < G2 X G2 < ^4 < S4, 

where A4 is the alternating group of order 12 (given by the even permutations 
of 4 elements) and C2 x C2 is the Klein group of order 4 (corresponding to the 
permutations 1, (12)(34), (13)(24), (14)(23)). The quotient groups are 

G2 X G2/{1} ~ G2 X G2 abelian of order 4 
A4/G2 X G2 — G3 abelian of order 3 
S4/A4 ~ G2 abelian of order 2. 
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We finish by introducing the notion of a nilpotent group. We will skip 
the general definition, and consider only finite nilpotent groups, for which the 
following characterization is available. 

Proposition 1.50. The following statements are equivalent. 

1. G is the direct product of its Sylow subgroups. 

2. Every Sylow subgroup of G is normal. 

Proof. If G is the direct product of its Sylow subgroups, that every Sylow sub- 
group of G is normal is immediate since the factors of a direct product are 
normal subgroups. 

Assume that every Sylow subgroup of G is normal, then by Lemma 1.37, 
we know that every normal Sylow p-subgroup is unique, thus there is a unique 
Sylow Pi-subgroup Pi for each prime divisor pi of \G\, i = 1, . . . , fc. Now by 
Lemma 1.32, we have that |Pii-2| = I 1 1 ^2 1 since Pi n P2 = and thus 
I-Pl • • • -Pfcl = |Pi| • • • \Pk\ = |G| by definition of Sylow subgroups. Since we work 
with finite groups, we deduce that G is indeed the direct product of its Sylow 
subgroups, having that G = Pi ■ ■ ■ Pk and Pi n Ylj^i Pj is trivial. □ 

Definition 1.35. A finite group G which is the product of its Sylow subgroups, 
or cquivalently by the above proposition satisfies that each of its Sylow subgroup 
is normal is called a nilpotent group. 

Corollary 1.51. Every finite abelian group and every finite p- group is nilpotent. 

Proof. A finite abelian group surely has the property that each of its Sylow 
subgroup is normal, so it is nilpotent. 

Now consider P a finite p-group. Then by definition P has only one Sylow 
subgroup, namely itself, so it is the direct product if its Sylow subgroups and 
thus is nilpotent. □ 

Finite nilpotent groups are also nicely described with respect to their nor- 
malizer. 

Proposition 1.52. If G is a finite nilpotent group, then no proper subgroup H 
of G is equal to its normalizer Ng{H) = {g G G, gH = Hg}. 

Proof. Let H be a proper subgroup of G, and let n be the largest index such 
that Gn C H (such index exists since G is nilpotent). There exists a G Gn+i 
such that a ^ G (since H is a. proper subgroup). Now for every h E H, the 
cosets aGn and /iG„ commute (since Gn+i/Gn ^ Z{G/Gn)), namely: 



G„a/i = {Gna){Gnh) = {Gnh){Gna) = G„/ia 
and thus there is some h' £ G„ C H for which 



ah 



h'ha 



that is 



aha 



-1 



h'h e H. 



Thus a £ NciH) and a<^H. 



□ 
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Definition 1.36. A central scries for a group G is a normal series 



{1} = G„ < G„_i < • • • < Go = G 



such that Gi/Gi+i C Z(G/Gi+i) for every i ~ 0, . . . ,n — 1. An arbitrary group 
G is said to be nilpotent if it has a central series. The smallest n such that G 
has a central series of length n is called the nilpotency class of G, and G is said 
to be nilpotent of class n. 



Example 1.35. Abelian groups arc nilpotent of class 1, since 



{l} = Gi<Go = G 



is a normal series for G and for i = we have G/{1}~GC Z{G). 



Nilpotent groups in general arc discussed with solvable groups since they 
can be described with normal series, and one can prove that they are solvable. 
Indeed, if G^/Gi+i C Z(G/Gi+i), then the elements of Gi/Gi^i commute with 
each other, since they commute with everything in G/Gi+i, thus Gi/Gi+i is 
abelian. It is not true that solvable groups are necessarily nilpotent (see Exer- 
cises for an example). 
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The main definitions and results of this chapter are 

• (1.1-1.2). Definitions of: group, subgroup, group 
homomorphism, order of a group, order of an element, 
cyclic group. 

• (1.3-1.4). Lagrange's Theorem. Definitions of: 
coset, normal subgroup, quotient group 

• (1.5). 1st, 2nd and 3rd Isomorphism Theorems. 

• (1.6). Definitions of: external (semi-)direct product, 
internal (semi-)direct product. 

• (1.7). Caylcy's Theorem, the Orbit-Stabilizer The- 
orem, the Orbit-Counting Theorem. Definitions of: 
symmetric group, group action, orbit, transitive ac- 
tion, stabilizer, centralizer. That the orbits partition 
the set under the action of a group 

• (1.8). Definition: p-group, Sylow p-subgroup. The 3 
Sylow Theorems. Cauchy Theorem 

• (1.9). Definition: simple group. Applications of the 
Sylow Theorems. 

• (1.10). Definitions: subnormal series, composition 
scries. Jordan-Holder Theorem. 

• (1.11). Definitions: characteristic subgroup, commu- 
tator subgroup, normal and derived series, solvable 
group, finite nilpotent group. 



Chapter 

Exercises on Group Theory 



Exercises marked by (*) are considered difficult. 

2.1 Groups and subgroups 

Exercise 1. Let G be a group and let 7? be a nonempty subset of G. We have 
seen that the two following statements are equivalent: 

a) H is a. subgroup of G, 

b) bi) x,y e H ^ xy e H 
62) X e H ^ x-^ (z H. 

1. Show that 61) is not sufficient to show that is a subgroup of G. 

2. Show that however, if G is a finite group, then bi) is sufficient. 
Answer. 

1. Consider for example the group G = Q* with multiplication. Then the 
set Z with multiplication satisfies that if a;,y G Z then xy e Z. However, 
Z is not a group with respect to multiplication since 2 G Z but 1 /2 is not 
in Z. 

2. Let X G H. Then take the powers x,x^,x^,... of x. Since G is finite, 
there is some n such that = 1, and by &i), a;" £ H thus 1 E H, and 

Exercise 2. Let G be a finite group of order n such that all its non-trivial 
elements have order 2. 

1. Show that G is abelian. 
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2. Let H he a subgroup of G, and let 5 € G but not in H. Show that HUgH 
is a subgroup of G. 

3. Show that the subgroup H U gH has order twice the order of H. 

4. Deduce from the previous steps that the order of G is a power of 2. 
Answer. 

1. Let a;,y G G, x,y not 1. By assumption, — = 1, which also means 
that x,y and xy are their own inverse. Now 

{xy){xy) xy = [xyy^ = y^^x^^ = yx. 

2. First note that H U gH contains 1 since 1 E H. Let x,y E HU gH. Then 
X E H or X Cz gH, and y €z H or y Cz gH. If both x,y E H, then clearly 
xy G H since H is a subgroup. If both x,y £ gH, then a; = g/i, y = gh' 
and = ghgh' = /i/i' € since G is commutative and g^ = 1. If say 
X G H and y G glf (same proof vice- versa), then xy ~ xgh ~ g{xh) e gH 
since G is commutative. For the inverse, ii x G H, then x^^ E H since iJ 
is a subgroup. If x £ gH, then a; = g/i, and a;^^ — h^^g^^ — gh since G 
is commutative and all elements have order 2. 

3. It is enough to show that the intersection of H and gH is empty. Let 
X E H and x E gH. Then x = gh for h E H , so that xh = gh^ = g, which 
is a contradiction, since xh E H and g is not in H by assumption. 

4. Take h an element of order 2 in G, and take H — {l,h}. li G ^ H we 
are done. If not, there is a g not in H, and by the previous point H U gH 
has order 4. We can now iterate. If G = iJ U gH we are done. Otherwise, 
H U gH = if' is a subgroup of G, and there exists a g' not in so that 
H'Ug'H' has order 8. One can also write a nice formal proof by induction. 

Exercise 3. Let G be an abelian group, and let x,y E G oi finite order. Show 
that \xy\ divides lcm{\x\, \y\), where Icm stands for "least common multiple". 
Give an example to illustrate that \xy\ 7^ lcm{\x\, \y\) in general. 

Answer. Let x € G be of order n and let y € G be of order m. Since G is an 
abelian group, we have that 

(xy)'' = x'^y'' 

for any k. Thus by definition, the order \xy\ is the smallest positive k such that 
j-kyk _ 2 Also by definition, the lcm{\x\, \y\) = lcm{n,m) = N satisfies that 
N = nn' = mm! , so that 

x^y^ (.T")"'(y'")'"' 1. 

Finally [xy)^ = {xy)'^ = 1 and since k is the smallest such positive integer with 
this property, it must divide N. (If you are not yet convinced, you can add that 
k is smaller than N, thus you can divide iV by fc and write N = kq + r, r < k, 
which implies that 1 = (xy)'"^^^ = {xyY , a contradiction to the definition of k.) 
An easy counter-example is: take y = x^^, x ^ 1, then |1| 7^ /cm(|a;|, |a;^^|). 
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Exercise 4. Let G be a group and let H and K be two subgroups of G. 

1. Is H (1 K a, subgroup of G? If your answer is yes, prove it. If your answer 
is no, provide a counterexample. 

2. Is H U K a, subgroup of G? If your answer is yes, prove it. If your answer 
is no, provide a counterexample. 

Answer. 

1. This is true. It is enough to check that xy~^ G H O K ioi x,y E H O K. 
But since x.,y G H, we have xy~^ £ H since _ff is a subgroup, and likewise, 
xy~^ G K for x,y G K since if is a subgroup. 

2. This is false. For example, take the groups of integers modulo 3 and 2, 
namely Z/3Z and Z/2Z. Then 2 and 3 are in their union, but 5 is not. 

Exercise 5. Show that if G has only one element of order 2, then this element is 
in the center of G (that is the elements of G which commute with every element 
in G). 

Answer. Let x be the element of order 2. Then yxy~^ has also order 2. Thus 
it must be either 1 or x. If yxy~^ = 1, then a; = 1 a contradiction. Thus 

yxy^^ = X. 

Exercise 6. Let G be a group and iJ be a subgroup of G. Show that 
NGiH)^{gGG, gH^Hg} 

and 

C'ciH) = {g gG, gh = hg for aU h G H} 
are subgroups of G. 

Answer. Take x,y G Ng{H). We have to check that xy^^ G Ng{H), that is, 
that xy^^H = Hxy^^. But Hxy^^ = xHy^^ since x G Ng{H), and xHy^^ = 
xy~^H since yH = Hy <^=^ y~^H = Hy^^. 

Now take x,y G Cg{H). We have to check that xy~^h = hxy^^ for all 
h G H. But hxy^^ = xhy~^ because x G Gg{H), and xhy~^ — xy^^h since 
yh = hy y^^h = hy^^. 

2.2 Cyclic groups 

Exercise 7. Let G = Z24 be the group of invertible elements in Z24. Find all 
cyclic subgroups of G. 

Answer. We have that the size of G is 

|G| = ip{2A) = ip{3)ip(2^) = 2-4 = 8. 
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G is given by the elements that are invertible mod 24, that is, those that are 
coprime to 24: 

G = {1,5,7, 11,13, 17,19,23}. 

Now 

(I) = {1}, 

(5) = {5,52 = 1}, 

(7) = {7,72 = 49 = 1}, 

(II) = {11, 11^ = 121 = 1} 
(13) = {13,132 = 169 = 1} 
(17) = {17,172 = (-7)2 = 1} 
(19) ={19, 192 = (-5)2 = 1} 
(23) = {23,232 = (-1)2 = 1 

and there are 8 cychc subgroups of G, including the trivial subgroup {1}. 

Exercise 8. Let G = Zjq be the group of invertible elements in Z2o- Find two 
subgroups of order 4 in G, one that is cyclic and one that is not cyclic. 

Answer. As in the exercise above, G contains 

\G\ = ip{2Q) = ip{'i)ip{^) = 2-4 = 8. 
These 8 elements are coprime to 20, that is 

G = {1,3,7,9,11,13,17,19}. 

The subgroup 

(3) = {3, 32 = 9, 3-'' = 7, 3"^ = 21 = 1} 
is cyclic of order 4. We have that 

11, ll2 = 121 = 1, 19, 192 = (-1)2 = 1, 11 • 19 = (-11) = 9, 92 = 81 = 1 

and 

{1,11,19,9} 
is a group of order 4 which is not cyclic. 

Exercise 9. Let Lp be the Euler totient function. Let G be a cyclic group of 
order n. 

1. First show that the order of is 

l/l = n/gcd{k,n). 

2. Show that if m\n. then (g"^™) is the unique subgroup of G of order m. 

3. Prove that for every factor m of n, the number of elements in G with order 
m is exactly ip{m). 
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4. Furthermore, show that J2m\n '/'(™) = 

Answer. Let G ~ {g) be a cychc group of order n, so that every element in G 
is of the form 

g'', l<k<n. 

1. Set m = gcd{k,Ti), so that k = mk' , n = mn' . If (5'^)'^ = 1, then n\kr, 
and 

n kr 
m m 

By definition of m, n/m and k/m arc coprime, so that n/m divides r. 
Hence n/m is the smallest power of g^ such that = 1 showing 

that n/m ~ \g''\. 

2. Let iJ be a subgroup of order to, then H = {g^) with = m and some 

> 0. We will show first that H can be also generated by an element g'' 
where d = gcd{k, n), and in particular, we can always write 

F= (/), d\n. 

Since d\k, k = dq and g*"' = £ (g'^) and (g'^n) C (5^). Conversely, 
d = gcd{k, n) = kr + ns for some r, s and 

gd ^ gkr + nS ^ gkr ^ l^gk^ 

and {g'^) C {g^). Now to = \H\ = \g'^\ = \g''-\ = n/gcd{d,n) by the above, 
and since d\n, we get that to = n/d, or d = m/n. 

3. Now for TO|n, an element is of order m if and only if it is the generator of 
the only subgroup of G of order to. Now there are as many generators for 
this subgroup as elements coprime to to, that is ip{m). 

4. To show that J2m\n = "-i can sort the elements of G according 
to their order. Since the order of each element divides n, we have 

n = nb of elements of order to = Lpim). 

rn\n m\n 

1.3 Cosets and Lagrange's Theorem 

Exercise 10. Let G = 6*3 be the group of permutations of 3 elements, that is 

G = {(1),(12),(13),(23),(123),(132)} 
and let H = {(1), (12)} be a subgroup. Compute the left and right cosets of H. 
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Answer. We have 



9 


9H 


Hg 


(1) 


{(1),(12)} 


{(1),(12)} 


(12) 


{(1),(12)} 


{(1),(12)} 


(13) 


{(13), (123)} 


{(13), (132)} 


(23) 


{(23), (132)} 


{(23), (123)} 


(123) 


{(13), (123)} 


{(23), (123)} 


(132) 


{(23), (132)} 


{(13), (132)} 



For example, H{23) is {(1)(23), (12)(23)}. Clearly (1)(23) = (23). Now (12)(23) 
sends 123 i— )■ 132 via (23), and then sends 132 M- 231 via (12), so that finally 
we have 123 M- 231 which can be written (123). 

Exercise 11. Let G be a finite group and let H and K be subgroups with 
relatively prime order. Then H D K = {1}. 

Answer. Since H D K is a subgroup of both H and we have 

\HC^K\ I \Hl \HC^K\ \ \K\ 

by Lagrange's Theorem. Since {\H\, \K\) = 1, it must be that \H n K\ = 1 
implying that H O K = {1}. 

2.3 Normal subgroups and quotient group 

Exercise 12. Consider the following two sets: 

^={(0 . a-cGM*, 5eM|, [/= ?)'^^^ 

1. Show that T is a subgroup of GL2{M.). 

2. Show that [/ is a normal subgroup of T. 
Answer. 

1. It is enough to show that if X,Y e T, then XY^^ e T. Let 

a b\ _ fa' b' 



then 



^ - '0 cj' ^ - \0 c' 



1 I a h\^(d -b'\ _ \_ fac' -ab' + a'b\ 
~ [0 c a'cJ [0 a' j a'c' I a'c ^ 
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2. We have to show that XYX'^ e U when Y € U and X € T. We have 

a' a'b + b'\ J_ (d -b' 

c' ) a'd \Q a' 

1 (a'd -b'a' + a'{a'b + b') 
a'd 



e U. 



Exercise 13. Let G be a group, and let be a subgroup of mdcx 2. Show 
that H is normal in G. 



Answer. If H is of index 2. that means by definition that there are only 2 
cosets, say H and giH for some gi not in H. Note that ii gi ^ g2 & G are not 
in i?,then (7i(;2 G 7?. Indeed, we have that either gig2 G H ot gig2 G giH (recall 
that the cosets partition the group), and gig2 £ giH is not possible since g2 is 
not in H. In other words, if both (?2 are not in iJ, then (3152)^^(31.92)"^ G 

Now let h G H, g G G. If g G H, then ghg~^ G H and we are done. If 
g is not in H, then g/i is not in H and by the above remark we have that 
ghg^^ = {gh)g^^ G H (take gi = .9/1, .92 = .9^^)- Alternatively by the same 
above remark, since (gig2)H{gig2)^^ G H for every 51, 52 not in iJ, it is enough 
to wrote g as gig2, say gi = g {g is not in _ff) and 52 = 3^^/* (which is not in 
H either). 

Exercise 14. If Gi is normal in G2 and G2 is normal in G3, then Gi is normal 
in G3. True or false? 



Answer. This is wrong (we need the notion of characteristic to get transitivity, 
this is introduced in the section " Solvable and Nilpotent Groups" ) . An example 
is the dihedral group D4: 

D4^{rJ\f = l,r' = l,fr = r-'f). 

The subgroup 

H - (r/, /r) = {1, r/, r^Jr} ~ G2 x G2 
is isomorphic to the Klein group. We have that H <\G. Finally 

K^{rf)={l,rf}<H 

but K is not normal in G, since f ■ rf ■ f^^ = f ' i" f ' f = ff which is not in K. 

Exercise 15. Let G be a group and let Z{G) be its center (that is the elements 
of G which commute with every element in G). Show that if G/Z{G) is cyclic 
then G is abelian. Give an example to show that if G/Z{G) is only abelian, 
then G does not have to be abelian. 
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Answer. If G/Z{G) is cyclic, then G/Z{G) = {gZ{G)). Let x,y e G, then 
their corresponding cosets are xZ{G),yZ{G) which can be written 

xZ{G) = {gZiG})" = .g'Z(G), yZ{G) = {gZ{G))' = g'Z{G) 

and 

x = g^zi, y = g'z2, Zi,Z2 & Z{G). 

Now 

xy = g''zig'z2 = yx 

since zi,Z2 € Z{G). For example, consider the dihedral group D4 — {r, f\f'^ — 
l,r'* = 1,/r = r-i/>- Its center is Z{D4) = {l,r^}. Thus D^/ZiD^) is a 
group of order 4, it contains 4 cosets: Z{D4),rZ{D4), fZ{D4),rfZ{D4), which 
is isomorphic to the Klein group, which is abelian but not cyclic. 

Exercise 16. 1. Let G be a group. Show that if is a normal subgroup of 
order 2, then H belongs to the center of G. 

2. Let G be a group of order 10 with a normal subgroup H or order 2. Prove 
that G is abelian. 
Answer. 

1. Since H is of order 2, then H = {l,h}. It is furthermore normal, so that 
gHg^^ = {1, g/ig^^} is in H, thus ghg^^ = h and we are done, since this 
is saying that h commutes with every g d G. 

2. Since H is normal in G, G/H has a group structure, and \G/H\ = 
\G\/\H\ = 10/2 = 5. Thus the quotient group G/H is a group of or- 
der 5, implying that it is cyclic. Now take x, y in G, with respective coset 
xH , yH . Since the quotient group is cyclic, there exists a coset gH such 
that xH = {gH)'^ ~ g^H, and yH ~ {gH)^ = g^H for some fc, L Thus 
X = g'^h, y = g^h' for some /i, h' e H. We are left to check that xy = yx, 
that is g^hg^h' = g^h'g^h, which is true since we know that h,h' G H 
which is contained in the center of G (by the part above) . 

2.4 The isomorphism theorems 

Exercise 17. Consider A the set of afhne maps of R, that is 
A = {f -.x^ ax + b, aeM*, 6 G R}. 

1. Show that A is a group with respect to the composition of maps. 

2. Let 

N = {g : X ^-^ x + h, G M}. 
Show that is a normal subgroup of A. 

3. Show that the quotient group A/N is isomorphic to M*. 
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Answer. 

1. Let /,g e A. Then 

(/ o g){x) = f{ax + b) ~ a {ax + b) + b' = a ax + a'b + 6', 

where a'a G M* thus the elosure property is satisfied. The composition 
of maps is associative. The identity element is given by the identity map 
since 

Ido/ = /oId = /. 

Finally, we need to show that every / e A is invertible. Take = 
a^^x — a^^b. Then 

/-I o f{x) = r'^iax + 6) = a-\ax + 6) - a-^b = x. 

2. Let g G N and let / e A. We have to show that 

/o.go/-l eiV. 

We have 

/ o g{a-^x - a-^b) = f{a-\x) - a-^b + b') ^ x - b + ab' + b e N. 

3. Define the map 

cp : A^R*, f{x) = ax + 6 i-> a. 
It is a group homomorphism since 

f{f°9) = a'a = ip{f)ip{g). 

The kernel of is iV and its image is R* . By the 1st isomorphism theorem, 
we thus have that 

A/N ~ M*. 

Exercise 18. Use the first isomorphism theorem to 
1. show that 



2. show that 
where 

3. compute 
Answer. 



Gi„(M)/S'L„(K) ~M*. 

c7c/~m;, 

U = {zc,C* \ \z\^ 1}. 
M/27rZ. 
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1. Consider the map: 

det : GL„(M) ^ R*, X ^ dct{X). 

It is a group honiomorphism. Its kernel is S'i„(M), its image is M* and 
thus by the 1st isomorphism theorem, we have 

G'i„(R)/S'L„(M) 

2. Consider the map 

exp : C* K;. 

It is a group homomorphism. Its kernel is U, and its image is M.'^ and thus 
by the 1st isomorphism theorem, we have 

c*/u ~ m;. 

3. Define the map 

/:R^C*, .Th^e^^. 

It is a group homomorphism. Its kernel is 27rZ. Its image is {e'^, x G 
M} = U. Thus by the 1st isomorphism theorem 

R/2nZ ~ U. 

Exercise 19. Let G ~ (x) be a cyclic group of order ?7 > 1. Let : Z ^ G, 
m I-)- x™. 

• Show that hx is surjective and compute its kernel. 

• Show that G ~ Z/nZ. 
Answer. 

• Let g G G. Since G = {x), g = for some < fc < n — 1 and thus is 
surjective. Its kernel is the set of m such that x™ = 1, thus m must be a 
multiple of n and Ker[hx) = riZ. 

• By the 1st isomorphism theorem, since is a group homomorphism, we 
have 

G ~ Z/tiZ. 

Exercise 20. Prove the second isomorphism theorem for groups, namely that 
if H and N are subgroups of G. with N normal in G, then 

H/{Hr\N) ~ HN/N. 

Answer. Let tt be the canonical epimorphism from G to G/N, and let ttq be the 
restriction of tt to H. Then the kernel of ttq is HON, so by the 1st isomorphism 
theorem for groups, we have that H/ {H n N) is isomorphic to the image of ttq 
which is {hN, he H}^ HN/N. 
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Exercise 21. Prove the third isomorphism theorem for groups, namely that if 
N and H are normal subgroups of G, with TV contained in H, then 

G/H^{G/N)/{H/N). 

Answer. This follows from the 1st isomorphism theorem for groups, if we can 
find an cpimorphism of G/N into G/H with kernel H/N: take f{aN) = aH . 
Now / well-defined, since if aN = bN, then a~^b G N C H so aH = bH. Since 
a is arbitrary in G, / is surjective. By definition of coset multiplication, / is a 
homomorphism. The kernel is 

{aN, aH = H} = {aN, a e H} = H/N. 

Exercise 22. Consider the short exact sequence of groups 

1^-^A^^B—^G^-^l 
where i is the inclusion and j is the constant map 1. 

1. Show that Im{u) — Keriv) v o u ~ 1, and Ker{v) C Im[u) (1 
denotes the constant map here). 

2. Show that in the short exact sequence, we have that u is injective and v 
is surjective. 

3. Show that u{A) is normal in B and that we have a group isomorphism 

B/u{A) ~ C. 

Answer. 

1. If Im{u) ~ Ker{v), then clearly Ker{v) C Im{u). Then v o u(x) — 
v{u{x)) ~ 1 since u{x) is in the kernel of v. Conversely, we have to show 
that Irniu) C Ker{v). Let u{x) S Im{u). Now v{u{x)) = 1 and thus u{x) 
is in Ker{v). 

2. To show that u is injective, we compute its kernel. Now Ker{u) — Im{i) = 
{1} and u is injective. To show that v is surjective, we have to see that 
Im{v) = G, but Im{v) = Ker{j) = G. 

3. Since Im{u) — Keriv), u{A) is normal in B, and we conclude by the 1st 
isomorphism theorem. 

2.5 Direct and semi-direct products 

Exercise 23. The quaternion group Qg is defined by 
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with product • computed as follows: 

1 ■ a = a ■ I = a, VaG Qs 
(-1) • (-1) = 1, (-1) ■a = a- (-1) = -a,VaeQs 

i ■ i ~ j ■ j ~ k ■ k = —1 
i- j k, j -1 = -k, 
j ■k = i, k- j = -i, 
k ■ i = j, i ■ k = —j. 

Show that Qg cannot be isomorphic to a semi-direct product of smaller groups. 

Answer. By definition, a semi direct product must contain two smaller sub- 
groups of trivial intersection {1}. Now the smaller subgroups of Qs are {1,-1}, 
— j, — 1}, {1, /c, — fc, — 1}, and each contains —1 so that it is 
not possible that Qs is a semi-direct product. 

Exercise 24. Consider the set of matrices 

(where Fp denotes the integers mod p) . 

1. Show that G is a subgroup of 5*^2 (Fp). 

2. Write G as a semi-direct product. 
Answer. 

1. That G is a subset of 5*^2 (Fp) is clear because the determinant of every 
matrix in G is 1. We have to show that for X,Y G G, XY^'^ e G. This 
is a straightforward computation: 

fa b \ ( —d \ I ac~^ —da + bc \ „ 

[O a-^ )[ c )^[ a-^c J^^' 

2. Take 

«7^o. «eFp| 

and 

Both K and H are subgroups of G. Their intersection is the 2-dimensional 
identity matrix, and HK = G, since 



f 1 b \ f a \ _ f a ba-^ \ 
[O I )[0 a-^ )^[0 ) 
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and ba ^ runs through every possible element of Fp (since b does) . Also 
H is normal in G, since 

fa b \ f 1 b \ f a^^ -b \ f 1 a^b\ „ 

Note that K is not normal, which can be seen by doing the same compu- 
tation. Thus G is the semi-direct product of H and K. 

Exercise 25. Show that the group Z„ x Z,„ is isomorphic to Z„i„ if and only 
if m and n are relatively prime. Here Z„ denotes the integers modulo n. 

Answer. If m and n are relatively prime, then for a multiple of (1,0) to be 
zero, it must be a multiple of n, and for a multiple of (0, 1) to be zero, it must 
be a multiple of m. Thus for a multiple k of (1, 1) to be zero, it must be a 
multiple of both n and to, and since they are coprime, the smallest possible 
value of k is mn. Hence Z„ x Z^ contains an element of order mn, showing 
that Z,„ X Z„ is isomorphic to Z„j„. Conversely, suppose that gcd(m,7i) > 1. 
Then the least common multiple of m and n is smaller than toti, let us call it d. 
This shows that every element of Z„i x Z„ has order at most d and thus none of 
them can generate the whole group, so that it cannot be cyclic, and thus cannot 
be isomorphic to Z„j„. 

Note that one can also prove this result by the definition of direct product: 
we have that Z^ and Z„ are both normal subgroups of Zm„ because this is an 
abelian group. We are thus left to look at the intersection of Z^ and Z„. Recall 
that Z„j and Z„ are embedded into Zm„ as respectively 

= {0, f^, 2n, . . . , (m — l)n}, Z„ = {0, to, 2m, . . . ,{n — 1)to}. 

If TO and n are coprime, then Z,„ H Z„ = {0}. Conversely, if x belongs to the 
intersection and is non-zero, then x must be a multiple of both n and m which 
is not congruent to modulo mn, and thus to and n cannot be coprime. 

Exercise 26. Let Z3 denote the group of integers modulo 3. 

1. Show that the map 

cr : Z3 X Z3 ^ Z3 X Z3, (.T, y)^ {x + y, y) 
is an automorphism of Z3 x Z3 of order 3. 

2. Show that the external semi-direct product of Z3 x Z3 and Z3 by p, p : 
Z3 Aut{'Lz X Z3), i f-T- cr% is a non-abelian group G satisfying that 

a^b^ = {abf 

for any a, 6 in G. 
Answer. 
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1. So to be an automorphism, a has to be a group homomorphism, but 

a{{x+x',y+y')) = {x+x' +y+y' ,y+y') = {x+y,y)+{x' +y\y') ^ a{x,y)+(j{ 

It clearly goes from the group to itself, and it is a bijection. It is an 
injection 

a{x, y) = (t{x\ y') ^ {x + y,y) ^ {x' + y' ,y') ^ y ^ y' ,x = x' , 
and thus it is a surjection since the group is finite. It is of order 3, since 
(T{x,y) = {x + y,y), a'^{x,y) = {x + 2y,y), cr^{x,y) = {x + 3y,y) = {x,y). 

2. An element in the external semi-direct product is of the form {{x,y),i), 
and we have 

{{x, y),i){{x, y), i) = {{x, y) + a\x, y),2i), 

i{x,y),i)^ = {{x,y) + a'{x,y) + a^\x,y),3i) 

= i{x,y) + {x + iy,y) + {x + 2iy,y),3i) 
= {{3x + 3iy,3y),3i) 
= ((0,0), 0). 

This shows that for any element a of the semi-direct product a"^ = 0, thus 
= 0, ab is another element of the group thus (ab)^ = which shows 
that a^b^ = = (ab)^, though the group is non-abelian (because a is not 
the identity). 

2.6 Permutations and Group action 

Exercise 27. (*) In a group G of order n, for all divisors d of n, there exists 
at least one subgroup of order d. True or false? [Though the statement only 
involves the order of a group, there were not enough examples of groups seen in 
the lecture notes earlier to come up with a counter-example.] 

Answer. This is false. The smallest counterexample is the alternating group 
G = A4 of even permutation on 4 elements, given explicitly by 

A4 = {e, (12)(34), (13)(24), (14)(23), (123), 

(132), (124), (142), (134), (143), (234), (243)}. 

It has 12 elements. We will now show that it has no subgroup of order 6. Let 
H denote a subgroup of order 6, that is, H has index 2 in A4 and there are only 
two cosets, satisfying 

A4 = HU Ha, 
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for all a G A4 but not in H. Consider the coset Ha^. We have that either Ha^ = 
H or Ha^ = Ha. If Ha^ = Ha, then Ha = H and a G H, a, contradiction. 
Thus 

Ha^ = H, 

for all a £ A4 but not in H. But those a G H, because H is a subgroup, also 
satisfy that Ha^ = H, so that wc deduce that 

Ha^ = H, Va e A4. 

This in turn implies that 

e H Wa e Ai. 

Let now 6 G ^4 be an element of order 3, that is ~ e. Then = b~^ 
and b'^ £ H showing that b"^ £ H and finally b £ H. We have just shown 
that every element of order 3 in A4 are in H, which is contradiction, since A4 
contains 8 elements of order 3. (If you are not satisfied with this proof, please 
check "Variations on a Theme: A4 Definitely Has No Subgroup of Order Six!" 
by M. Brennan and D. Machale, available online, where 12 different proofs are 
provided.) 

Exercise 28. 1. Let G = GL„(C) and X = C" - {0}. Show that G acts on 
Xhy Gx X ^ X, {M, v) ^ Mv. 

2. Show that the action is transitive. 
Answer. 

1. We have to show that 

M ■ (M' ■ v) = {MM') -v, 1g-v^v. 

The first point is clear by properties of matrix vector multiplication. The 
second is also clear since 1g is the identity matrix. 

2. We have to show that there is only one orbit (which is why we have to 
remove the whole zero vector from C"). For that, we need to show that 
for any two vectors v,v' G X, there is a matrix M G G such that Mv = f' . 
We thus have a system of n linear equations for unknowns, so that we 
have enough degrees of freedom to find such a matrix. Alternatively, if 
v = (ai, . . . , a„), v' = (61, . . . , where a^, bi arc all non-zero, take the 
matrix 

diag(a;f\ . . . ,a-^) 

and notice that 

diag(6i, . . . ,&„)diag(aj"\ . . .,a~'^)i' = ly' . 
The case where some a^, bj are zero can be done similarly. 
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Exercise 29. Let G be group, and H he a subgroup of G. Show that 

g-g'H^ gg'H 

defines an action of G on the set G/H of cosets of H . Find the stabilizer of gH . 

Answer. To show that the action is well defined we have to check that it 
does not depend on the choice of the representative, and that it satisfies the 
definition of group action. First suppose that g' H = g" H . We have to show 
that g ■ g"H = gg'H. But g' H = g" H ^ (g")-^g' eH ^ igg'T^dd') e 
H '^==^ gg'H = gg"H. The definition of group action can be checked easily: 

gi ■ (52 ■ g'H) = gi ■ g2g'H = gig2g' H = gig2 ■ g'H, 1 • g' H = g'H. 

The stabilizer of gH is formed by g' such that g'gH = gH that is g^^g'g G H. 
Thus g^^g'g = h, for some ft, e iJ, or equivalently g' = ghg^^ , thus the stabilizer 
is gHg^^. 

Exercise 30. Consider the dihedral group given by 
Dg, = {1, s, r, r'^, r"^, rs, r^s, s\ 
(that is = \,r^ = \ and {rsf = 1). 

1. Divide the elements of the dihedral group into conjugacy classes. 

2. Verify the class equation. 
Answer. 

1. There are 5 conjugacy classes 

{I},{r2},{r,r3},{.s,sr2},{sr, sr^}. 

2. We have that {1} and {j-^} arc in the center. Thus 

\D,\ = 8 = \Z{D,)\ + \B{r)\ + \B{rs)\ + \B{s)\. 

Exercise 31. The quaternion group Qg is defined by 
with product • computed as follows: 

i-a = a- i = a, VaG Qs 
i-l) ■ (-1) = 1, (-1) • a = a • (-1) = -a, V a G Qg 

i ■ i — j ■ j = k ■ k = —1 
i- j = k, j ■{ = -k, 
j ■k = i, k- j ^ -i, 
k ■ i ~ j, i ■ k = — j. 
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1. Show that if X ^ Z{Qs), then \Cq^{x)\ = 4. 

2. Show that as a consequence, the class of conjugacy of a; ^ Z{Ds) has only 
two elements. 

Answer. 

1. The center Z{Qs) is Z{Qs) = {1, —1}. We have by definition that 
Thus 

Cq,{i) = {1,-1, 1, -i}, Cq,U) = {1,-1, J, -j}, CQ,{k) = {1,-1, k,-k}. 

2. When the action is defined by conjugation, we have that Stab(a;) = 
CQg{x). Thus by the Orbit-Stabilizer, the size of an orbit, which is a 
conjugacy class, is 

\B{x)\ = \Qs\/\CQ,ix)\=8/4 = 2. 

Exercise 32. Let G be a group and let H and K be two subgroups of G. 

1. Show that the subgroup H acts on the set of left cosets of K by multipli- 
cation. 

2. Consider the coset IK = K. Compute its orbit B{K) and its stabilizer 
Stab(K). 

3. Compute the union of the cosets in B{K) and deduce how many cosets 
are in the orbit. 

4. Use the Orbit-Stabilizer Theorem to get another way of counting the num- 
ber of cosets in B{K). By comparing the two expressions to count the 
cardinality of B{K), can you recover a known result on the cardinality of 
HK? 

Answer. 

1. Let X = {gK, g G G} be the set of left cosets of K. We have to check 
that h' ■ {h ■ gK) = {h'h) ■ gK which clearly holds, as does 1^ • gK = gK. 

2. We have that B{K) = {h- K, he H} and Stab(A') = {h e H, h-K = 
K) =HnK. 

3. The union of the cosets in B{K) is HK, the cosets in B{K) are disjoint 
and each has cardinality K, so that we have |iJA' cosets in B{K). 
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4. By the Orbit-Stabilizer Theorem, we have 



\B{K)\ = |iJ|/|Stab(/0| ^ \HK\/\K\ = \H\/\HnK\ 



and thus 



\HK\ 



\H\\K\ 

\HnK\' 



Exercise 33. Let G be a finite group, and let p be the smallest prime divisor 
of the order of G. 

1. Let iJ be a normal subgroup of G. Show that G acts on H by conjugation. 

2. Let H he a normal subgroup of G of order p. 

• Show that the orbits of H under the action of G are all of size 1. 

• Conclude that a normal subgroup H of order p is contained in the 
center of G. 



1. We check the definition, that is, the group G acts on H if for the map 
{g,x) 1-^ g ■ X = gxg~^, x £ H, defined from G x H ^ H (note that we 
need here H normal to guarantee that gxg~^ e HI), we have 

• h ■ (g ■ x) ^ h ■ [gxg^^] = h(gxg^^)h^^ = (hg) ■ x 

• I ■ X = X for all X Cz H 

2. • By the orbit stabilizer theorem, the size of an orbit B{x),x £ H 

divides the size of G, the group that acts on H, thus if is 
not 1, it must be at least p, since p is the smallest divisor of the 
order of G. Now the orbits partition H, that is H = \JB{x) and 
thus \H\ = J2 \B{x)\, that is the sum of the cardinals of the orbits is 
\H\ = p. Among all the B{x), we can take x = 1 E H since H is a 
subgroup. The orbit B{1) = {g ■ 1, g & G} = {glg^^ = 1} has only 
1 element, there is at least one orbit of size 1, and thus no orbit can 
have size greater or equal to p, since then p + I > p. Thus all orbits 
of H are of size 1 . 

• We have that B{x) = {g ■ x,g ^ G} — {gxg~'^,g e G} is always 
of size 1, and since for g = 1 e G we have x € B{x), we deduce 
that B{x) = {x}, that is gxg~^ = x, or gx = xg showing that for 
all X € H , X actually commutes with every g € G, that is, H is 
contained in the center. 

Exercise 34. Let G be a group acting on a finite set X. 



Answer. 



1. We assume that every orbit contains at least 2 elements, that |G| = 15, 
and that \X\ = 17. Find the number of orbits and the cardinality of each 
of them. 
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2. We assume that \G\ = 33 and \X\ = 19. Show that there exists at least 
one orbit containing only 1 element. 

Answer. 

1. The cardinal of every orbit divides the order of G. Furthermore, the sum 
of the orbit cardinalities is equal to the cardinality oi X. If jGI = 15, 
\X\ = 17, and there is no orbit of size 1, there is only one possibility: 4 
orbits of length 3 and 1 of length 5. Indeed, we are looking for integers 
such that their sum is 17, but each integer must divide 15, that is we need 
to realize 17 as a sum of integers belonging to {3, 5, 15} (1 is excluded by 
assumption). Then 15 is not possible, and we can use only 3 and 5: 15+2 
is not possible, 10+7 is not possible, so only 5+12 works. 

2. Now |G| = 33 and \X\ ^ 19. The divisors of 33 arc 1,3,11 and 33. We 
need to obtain as above 19 as a sum of these divisors. 33 is too big, and 
we cannot possibly use only 11 and 3. Thus there must be at least one 
orbit of size 1 . 

Exercise 35. Let G be a finite group of order n > 1 and let p be a prime. 
Consider the set 

X = {x = {gi,g2,-- ■,gp) € G^ | 51 • 52 • • - .9^ = 1g}- 

1. Compute the cardinality \X\ of the set X. 

2. Show that if (171, . . . ,gp) G X, then (52, ■ • ■ i5pj5i) G X. Denote by a the 
corresponding permutation. Show that < a > acts on X as follows: 

o''' ■ {gi, ■■■,gp) ^ (3<t'=(i), ■ • ■ ,5<Tfc(p)), k 

3. What is the cardinal of one orbit of X7 

4. What are the orbits with one clement? Show that there is at least one 
such orbit. 

5. Deduce that if p docs not divide n, then 

n^~^ = 1 mod p. 

6. Deduce Cauchy Theorem from the above, namely, if p | n then G has at 
least one element of order p. 

Answer. 

1. Since gi, . . . ,5p-i can take any value in G (only gp is constrained so as to 
have gi- g2---gp^ Iq), we have \X\ = |G|P"1 = n^-^ 

2. Since (ffi, . . . , 5p) € X, then ffi • 32 • • • .9p = 1g and 52 • • • 5p • ffi = Oi^ ■ 
1(3 • gi showing that (172, ■ • • , 5p, gi) € X. To show that < ct > acts on X, 
check the definition, namely ct' • (it'' • (gi, . . . ,gp)) = (t^(t^ ■ {gi, . ■ ■ ,gp) and 

• {gi,---,gp) = (.9i,---,5p)- 
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3. The answer is either 1 or p. There are two ways to do it: one can notice 
that < a > has order p, and thus by the Orbit-Stabihzer Theorem the 
size of the orbit divides p, so it can be either 1 or p. Also one can just 
write down the definition of one orbit: the orbit of (gi, . . . ,gp) is formed 
by all the shifts of the components, and thus since p is prime, there will 
be p distinct shifts, apart if all the components are all the same, in which 
case there is only one element in the orbit. 

4. Since an clement always belongs to its orbit, we have that orbits with one 
element are of the form B{x) = {x}, and if there is only one element, 
that means that the shifts are doing nothing on a; = ((71,..., pp) thus 
X = (g, . . . , (7) and since x €z X, that further means that gP = Iq. To 
show one such orbit exists, take the orbit of (1, . . . , 1). 

5. Since the orbits partition X, wc have 

\X\^J2\Bix)\+Y,\Bix')\ 

where the first sum is over orbits of size 1, and the second over orbits of 
size greater or equal to 2. By the above, if the size is at least 2, it is p, and 
thus |i?(a;')| = mod p. Then if there were more than (1, . . . , 1) with 
orbit of size 1, that means an element g such that g^ = 1, which would 
mean p\n, a contradiction. Thus only there is only one orbit of size 1, and 

\X\==nP-^ = l mod p. 

6. Again, we have that 

nP~' = \X\=J2\B{x)\+J2\B{x')\ 

and if p|n then = ^ 1^(2;) | and there must be at least another element 
with orbit size 1, that is an element g of order p. 

2.7 The Sylow theorems 

Exercise 36. Let G be a group of order 399. 

1. Show that G has a unique Sylow 19-subgroup P which is normal in G. 

2. Let Q be a Sylow 7-subgroup. Show that N = PQ is a subgroup of order 
133 of G. 

Answer. 

1. The number nig of Sylow 19-subgroups is = 1 mod 19 and divides 21, 
thus it must be 1. Since it is unique, it has to be normal. 
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2. Since P is normal in G, we have that N = PQ is a subgroup of G. 
(The fact that P is normal can be used to check directly the definition of 
subgroup). By the 2nd isomorphism theorem for groups, we have 

g/(0 n P) ~ PQ/P PQ/P 

since Q n P = {1} so that \PQ\ |P| |Q| = 19 • 7 = 133. 

Exercise 37. Let G be a simple group of order 168. 

1. Compute its number of Sylow 7-subgroups. 

2. Deduce the number of elements of order 7 in G. 
Answer. 

1. Since 168 = 2"^ • 3- 7, the number of Sylow 7-subgroups must be = 1 mod 7 
and must divide 24. The only possibilities are thus 1 and 8. Since G is 
simple, it cannot be 1. 

2. Elements of order 7 correspond to generators of the cyclic Sylow 7-subgroups, 
and there are 6 of them per Sylow 7-subgroup, that is 6 x 8 = 48 elements 
of order 7. 

Exercise 38. (*) This exercise aims at classifying groups of order up to 8. 

1. For p prime, show that any group G with cardinality is abelian. 

2. For p an odd prime, show that any non-abelian group G of order 2p is 
isomorphic to the dihedral group Dp. 

3. Determine all the finite groups of order at most 8 up to isomorphism. 
Answer. 

1. Let Z{G) be the center of G. We have by Lagrange Theorem that |^(G)| 
divides p^ , thus we have 3 cases: 

• |^(G)| = 1: we know that the center of a p-group cannot be trivial, 
thus this case cannot happen. 

• |^(G)| ~ p^: then clearly G is abelian. 

• \Z{G)\ = p: then |G/Z(G)| = p^/p = p, and then quotient group IS 
cyclic, and we already showed (see Exercise 15) that in this case it 
implies that G is abelian. 

2. Let G be a group of order 2p. Then G contains an element a of order p, 
and an element /3 of order 2, by Cauchy Theorem. We will prove 

{a,P)c^Dp. 
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• First we have that |(a,/3)| = 2p. Indeed, it must divide 2p, and it 
must be great than p, since (a) has ah'eady cardinahty p and does 
not contain (3. 

• We now show that 

Since (a) is normal in G, then 

for some i. Since = 1 

and = 1 mod p. If z = 1 mod p, then a and /3 commute, so G 
cannot be non-abelian. Thus i = —\ mod p as wanted. 

• FinaUy, it is enough to conchidc to show that 



is an isomorphism, where a is a rotation of angle 27: /p and 6 is a 
reflection. This map is surjective, and both sets have same size, so it 
is injective. It is also clear that / is a group homomorphism. 

3. • For prime order \G\, we have that G is cyclic, so that gives C2, C3, 



• For \G\ — A, we already know that either G contains an clement of 
order 4, and G = C4, ot G has only elements of order 2 (except of 
course the identity), and then G = C2 x C2. 

• If |G| = 6, if G contains an element of order 6, then G ~ Cq. If not, 
then G must contain 1 element of order 3, and one of order 2, and 
by the above computations, we have G = Dq. 

• If |G| = 8, then elements in G can have order 2,4 and 8. If there is an 
element of order 8, then G = Cs- If all elements have order 2, then 
G is abelian (namely G2 x G2 x G2), so for G to be non-abelian, we 
must have an clement of order 4, say g. Now we have 1, g, g^,g^ G G. 
If there is an element h of order 2, h not in (g), then hah~^ has order 
4, and repeating the above computations, we can see that G = Dg. 
If such an h does not exist, then all elements not in (g) have order 4. 
Let k be such an element of order 4, then has order 2 and must 
then be g^. In this case, we obtain G = Q%. Finally, if G has an 
element of order 4 and is abelian but not cyclic, then G = G2 x G4. 

Exercise 39. Consider the set of matrices of the form 



/(a"6^) 



G5, G7. 
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where a, 6, c are integers modulo 2. Show that this set forms a group and 
compute its cardinaUty. Can you identify this group? 

Answer. This set forms a group under matrix multipHcation. Matrix mul- 
tiphcation is clearly associative. One can check that the product of two such 
matrices still belongs to the set. The identity element is given by the identity 
matrix. Every matrix is clearly invertible with inverse 





a 
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1 



















a 






a 
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1 
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Since a, b, c are integers modulo 2, there are 8 possible such matrices. Thus it is 
a group of order 8. It is not commutative, thus it is either D4 or Qg. We note 
that 

' 1 



thus 



is an element of order 4, and the subgroup it generates is 
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So when a = and c = 1, or a = 1 and c = 0, we see that we get elements of 
order 2 not in this subgroup, thus it must be 1)4. (This line of argument comes 
from the above exercise where we did the classification of groups of order 8.) 
Alternatively, we can notice that 




is of order 2, and the intersection of this subgroup of order 2 with that above 
of order 4 is the identity. Again this cannot happen with Qg since we know 
the intersection of any two subgroups cannot be trivial (this was the line of 
argument used to show that Qg cannot be a semi-direct product). 

Exercise 40. Let G be a finite p-group, and let H he a normal subgroup of G. 
Show that H D Z{G) cannot be trivial (where Z{G) denotes the center of G). 
Is it still true when G is an infinite p-group?[this is harder!] 



Answer. The subgroup H is normal, thus G acts by conjugation on H. Since 
\G\ = for some r, the size of the non-trivial orbits is divisible by p. Since 
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G is a p-group, H is also a p-group, with say \H\ = for some s < r. We 
deduce that the union of orbits has size p"^ , and since the orbits of size > 2 are 
divisible by p, the union of trivial orbits, that is of orbits of fixed points, is also 
of cardinality divisible by p. But the union of orbits of fixed points is HC\Z{G). 
Since it contains the identity clement, it at least contains p elements and cannot 
be trivial. 

This is not true anymore when the group is infinite. Take for example the 
group 

G = U C/(p,n), 

n>l 

where U {p, n) is the group of n x n upper triangular matrices with every diagonal 
coefficient at 1 and elements of Fp discussed in Example 1.31. This is an infinite 
p-group. Now the center Z(U{p,n)) of U{p^n) is a cyclic group of order p, 
formed by matrices with every diagonal coefficient at 1, one element of Fp in 
position (l,n) and zeros elsewhere. Thus the center of the infinite p-group G 
which is itself the limit of the U{p,n) when n grows will be contained in the 
limit of these centers, which is in fact the trivial group (that is the limit of the 
identity matrix when n tends to infinity). 

Exercise 41. Let G be a group of order 57 which is not cyclic. 

1. Compute the number of its Sylow 19-subgroups. 

2. Deduce the number of elements of order 3 in G. 
Answer. 

1. Since 57 = 3 • 19, ni9|3 and 7219 = 1 mod 19, showing that 77,19 = 1- 

2. Let a 7^ 1 be in G. Then \a\ = 3 or \a\ = 19 (|a| 7^ 57 since G is not cyclic 
and \a\ ^ 1 since a ^ 1). Since there is only one Sylow 19-subgroup, which 
is cyclic, it contains 18 elements of order 19, which are all the elements of 
order 19 of G. Thus there are exactly 38 elements of order 3 in G. 

Exercise 42. Let G be a group of order 231. 

1. Show that G as a unique Sylow 11-subgroup M. 

2. Compute the number 717 of its Sylow 7-subgroups. 

3. Let P be a Sylow 3-subgroup and L be a Sylow 7-subgroup. Show that 
N PL forms a subgroup of order 21. 

4. Show that N is furthermore normal. (*) 

5. Deduce from the above that G can be written as MN . 

6. Prove that the Sylow 11-subgroup M belongs to the center of G. 



Answer. 
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1. We have that 

231 = 11 • 3 • 7. 
Thus rill = 1 niod 11 and rin | 21. Thus it must be 1. 

2. Again, ny = 1 mod 7 and ny divides 33. Thus it must be 1. 

3. Notice first that since L is a unique Sylow 7-subgroup, it is normal. We 
have to sec that PL is a subgroup. Let pi be an element of PL. Its inverse 
is (pl)^^ = l^^p^^ and since L is normal, wc know that gL = Lg for every 
g in G. Thus in particular l~^p^^ G Lp^^ = p~^L and l^^p~^ = p~^l' 
showing that the inverse belongs to PL. The same argument works to 
show that plp'l' E PL. Wc know that Ip' = p'l" by normality of L, thus 
p{lp')l' = p[p'l")l' e PL and 1 is in PL. Now about the order of PL, 
since \P\ = 3 and \L\ = 7, and P O L = {1} because they are groups of 
order respectively 3 and 7, we have that \PL\ = |P||L|/|Pn L| = 21. 

4. This is the most difficult part. The most likely quickest way to do it is by 
using the normalizer Ng{N) of N. It is a subgroup of G, thus its order 
divides \G\. Since is contained in its normalized, \Ng{N)\ > 21. So if we 
can prove that \Ng{N)\ > 21, we are done, because then \Ng{N)\ = 231 
and the normalizer is G, which yields the desired conclusion. To show 
that \Ng{N)\ > 21, it is enough to exhibit at least one element in the 
normalizer which is not in TV. This is not the most economical way of 
doing it, but one way is to just get an element in the center. By Cauchy 
Theorem, we know G contains an element g of order 3, and since M is 
the unique Sylow 11-subgroup, it is a normal subgroup of G, which is 
furthermore cyclic, generated by say m. We thus have that gM = Mg 
that is gm ~ m}' g and we are left to show that I is one. Since g is of order 
3, then m = g^mg'^ = g'^{gmg-^)g-'^ = g'^m'-g-'^ = g{gm'-g-^)g-^ = 
g{m}yg~^ = m} . This shows that = \ mod 11. But this is possible 
only \il = 1 (gcd(3, 10) — 1) and m belongs to the center. 

5. That MN \s a. group works as above, because TV is normal. That the 
cardinality is right also works as above, since \M\ = 11 and \N\ = 21, thus 
their intersection is 1, and thus |Af = 231, from which it follows that 
MN = G. 

6. We first observe that M commutes with N . since if m e M, n £ N, 
then by normality of M (it is a unique Sylow 11-subgroup) we have that 
mnm~^n^^ £ M D N = {1} implying that mn = nm. To show that M 
commutes with G, we use the fact that G = MN, and thus every element 
g € G can be written as g = m'n' . Now gm = {m'n')m = m'mn' since M 
and N commute, and m'mn' ~ mm'n' = mg since M is abelian (it is in 
fact cyclic). 
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2.8 Simple groups 

Exercise 43. Show that no group of order 200 is simple. 

Answer. The number of Sylow 5-subgroups of a group of order 200 = 5^ • 2"^ is 
= 1 mod 5 and divides 8, thus it must be 1. Thus the unique Sylow 5-subgroup 
is normal and thus the group cannot be simple. 

Exercise 44. (*) Let G be a group such that \G\ < 59. Show that if G is 
simple, then |G| is prime. 

Answer. We know that if \G\ = pq, p,q two distinct primes, then G is not 
simple. This can be extended by noting that if \G\ = p'^m, where k > and 
(to,p) = 1, m < p. Indeed, in this case, Up must divide mp*', and thus must 
divide m. But also. Hp = 1 mod p, so that if rip 7^ 1 then Up > p + 1 > m, 
which contradicts that np\m. Wc are thus left to check that groups of order 

12,24, 30,36,40,45,48,56 

are not simple. We know G with \G\ = 45 = 5 • 3^ is not simple since G is not 
simple when \G\ = p^q. If \G\ ~ 40, then the number of Sy- low 5-subgroups 
is congruent to 1 modulo 5 and divides 8. The only possibility is = 1, and 
G has a normal Sylow 5-subgroup. If \G\ = 56 = 2"^ • 7, then the number nj of 
Sylow 7-subgroups must divide 8 and = 1 mod 7. If ny = 1, G is not simple. 
If riy ~ 8, then we get 6 • 8 = 48 elements of order 7, and only 8 elements are 
left in the group not of order 7, which correspond to the Sylow 2-subgroup of 
size 8. Similarly if |G| = 12, there are 713 = 1 mod 3 Sylow 3-subgroups, where 
rt3 1 12, so this can be 1 or 4. If 71,3 = 1 G is not simple. If 713 = 4, we get 4-2 = 8 
elements of order 3. The other 4 elements must be part of the Sylow 2-subgroup 
which is of order 4. We are thus left with 

24, 30, 36, 48 

One way to take care of 24, 36 and 48 at once is to prove that the order of 
G divides 7ip!/2. Otherwise it can be done case by case. |G| = 30 is done 
individually. 

Exercise 45. Let G be a group of order 105. Prove that it is impossible that 
|Z(G)|=7. 

Suppose to the contrary that \Z{G)\ = 7, then \G\/\Z{G)\ = 15 and thus 
G/Z{G) is a group of order 15. Since 15 = 3 • 5, that is p = 3, <; = 5, with p 
which does not divide g — 1 = 4, then G/Z{G) is cyclic (by Proposition 1.39) 
and thus G is abelian (by Exercise 15). 

Exercise 46. Let G be a group, H a subgroup, and consider Nc{H) = {g E 
G, fjH = Hg}. 
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1. Show that the number of conjugates of U in G is equal to the index of 
Ng{H) in G. 

2. Deduce a formula for the number of Sylow p-subgroups of G. 

3. Use the above to show that a simple group G of order 60 cannot have a 
subgroup of order 20. 

Answer. First recall that Ng{H) is a subgroup of G (see Exercise 6). 

1. Let G act by conjugation on the set X of its subgroups. By the orbit- 
stabilizer theorem, we have that \B{H)\ = \G\/\NGiH)\ since NoiH) is 
the stabilizer of H. Now \B{H)\ is the number of conjugates of H, which 
proves the claim. 

2. Let _ff be a Sylow p-subgroup of G. We know by the Sylow theorems that 
all Sylow p-subgroups are conjugate, thus Hp = \B{H) \ and Hp is the index 

3. Assume by contradiction that K is a subgroup of order 20, then K has a 
unique Sylow 5-subgroup L which is then normal, and thus K C Nq{L). 
Now the order of K must divide the order of Ng{L) and the order of G, 
and since \K\ = 20, we have that \Ng{L)\ is 20 or 60: 

• if it is 60, then the index of Nq[L) is 1, and there is a unique Sylow 
5-subgroup in G, which is then normal, and contradicts the simplicity 
of G. 

• if it is 20, then the index is 3, and there are 3 Sylow 5-subgroups 
in G, that is ns = 3 which contradicts the Sylow theorems: = 1 
mod 5 

Exercise 47. (*) Let G be a simple group of order 60, which thus cannot 
contain a subgroup of order 20. 

1. Show that if G contains a subgroup K of order 12. then K contains 4 
Sylow 3-subgroups. 

2. Show that if H and K are two distinct subgroups of order 4 of G, then 
HnK ^ {1}. 

Answer. 

1. K is of order 12, thus its number of Sylow 3-subgroups is congruent to 
1 mod 3, so 

= 1 or = 4 

(it cannot be more since K is of order 12). To show that it must be 4, 
we show that it cannot be 1. Let us assume that K has a unique Sylow 
3-subgroup L, then L must be normal in K, then K must be contained 
in Ng{L) = {g E G, gL = Lg}, and since Ng{L) is a subgroup of G, its 
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order must divide 60, and also be divisible by the order of K which is 12. 
Thus 

|7Vg(L)| = 12 or 60. 

Its index in G is then respectively 5 or 1. Since its index is also the 
number of Sylow 3-subgroups (because they are all conjugate, and using 
the Orbit Stabilizer theorem), that means that we have either 1 or 5 
Sylow 3-subgroups in G, which is impossible; 1 is impossible since G is 
simple, and 5 is impossible since \G\ = 60, and thus its number of Sylow 
3-subgroups is congruent to 1 mod 3. 

2. The intersection of two subgroups is a subgroup, thus its order is either 
1,2 or 4. It cannot be 4 since they are distinct, to prove that it is 1, let 
us assume by contradiction that it is 2, thus a cyclic group of order 2: 

HnK ^ (a) 

and = 1. Since the order of H is 4, the index of (a) in H is 2, and 
(a) is normal in H, that is hah^^ = a and H is contained in Gg{o) = 
{g G G, ga = ag}. The same holds for K and since both H and K are in 
Caia), so must be their union H U K. Now the size of _ff U isT is 6, and 
thus |CG(a)| > 6 and must be divisible by 4. The possibilities are 

12,20,60. 

Now 20 is impossible, by what is mentioned in the statement of the ex- 
ercise. 12 is also impossible by the previous question: then Caia) would 
have 4 Sylow 3-subgroups, showing that there should be only 1 Sylow 2- 
subgroup; but both H and K are in Caia)- This leaves 60: in that case, 
we would get that (a) is a normal subgroup, and thus it should be the 
unique Sylow 2-subgroup, which is also a contradiction. 

2.9 The Jordan-Holder Theorem 

Exercise 48. Prove that every finite group has a composition series. 

Answer. Take the longest possible subnormal series of G, say 

{1} = G„ < Gn-i <•••<] Go = G 

which is possible since G is finite. Then the composition factors are all simple. 
Indeed, if there exists a composition factor Gi/G^+i which is not simple, then it 
has a non-trivial normal subgroup, which by the correspondence theorem gives 
a normal subgroup between Gi and G^+i. This extends the subnormal series 
assumed to be the longest possible, thus a contradiction. The longest possible 
subnormal series is then a composition series. 
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Exercise 49. Prove that the infinite cyclic group G has no composition series. 

Answer. Let G ~ (g) where \g\ = oo. Suppose there exists a composition series 
for G, that is 

{1} - G„ < G„_i <•••<] Go - G. 
It cannot be that n = 1, that is 

{l} = Go<]G 

because that would mean that G is simple, and clearly G is not simple, because 
for example (g^) is a proper normal subgroup of G. Thus n > 2, and the 
composition series contains a subgroup G„_i different than {1} and G. Then 
G„_i is a non-trivial subgroup of G, which means it is of the form G„_i ~ {g^) 
for some positive integer k. But then G„_i is an infinite cyclic group, so it 
cannot be simple, which contradicts the definition of composition series. 

Exercise 50. Show that the group GL„(M) has a subnormal series, but no 
composition series. 

Answer. The series 

{1} < Z(GL„(]R)) < 5L„(R) < GL„(M) 

is a subnormal scries, however GL„(M) cannot have a composition series, be- 
cause it contains a normal subgroup isomorphic to the infinite cyclic group 
(look at matrices of the form a scalar times the identity matrix, which clearly 
commute with every matrix in GL„(M)). 

2.10 Solvable and nilpotent groups 

Exercise 51. Consider the general dihedral group 

D2n = {a,h\ a" = 6^ ^ 1, h-^ab = a^^}. 
Is solvable? Prove your answer. 

Answer. We have seen several equivalent definitions of G solvable. One of 
them is that G is solvable if and only if there exists a normal series 

{1} = Go < Gi < G2 < • • • < G„ = G 

such that Gi+i/Gi is abelian. The rotations (a) form a subgroup of order n, this 
is not the case of the reflections (composing two reflections gives a rotation). 
We have that (a) is normal. So the series 

{1} < (a) < D2n 

is normal and the quotient I?2n/(a) has order two hence is the cyclic group of 
order 2 which is abelian. 
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Exercise 52. Consider thel dihedral group 

De = {a, 5 I = = I, b^^ah = a^^}, 

which was shown to solvable in the previous exercise. Is Dq nilpotent? Prove 
your answer. 

Answer. It is not, because the number n2 of its Sylow 2-subgroups is 3. Indeed, 
we know that 77,2 = 1 mod 2, and 712 = mod 3. This contradicts the fact that 
a finite nilpotent group must have a unique Sylow p-subgroup for every p. 

Exercise 53. Show that the nilpotency class of the quaternion group Q% is 2. 
Answer. A central series for Qg is 

{l} = G2<Gi = {l,-l}<Go = Q8. 

Indeed, we notice that Gi is the center of Qg, thus it is normal in Qg, which 
shows that this is a normal series. Now for i = 1, we have that G1/G2 — 
{1,-1} C Z(G/G2) ~ {1,-1}. For i = 0, we have that Go/Gi ~ G/Z(G), 
since G/Z(G) is of order 4 and cannot be cyclic (otherwise Qs would be abelian) 
it must be the Klein group, which is commutative (anyway knowing that the 
quotient group is of order 4 is enough to conclude the commutativity) . Thus 
G/Z{G) C Z{G/Z{G)). Finally, the central series cannot be shortened, since 
Qg is not abelian, thus the nilpotency class of Qs is 2. 

Exercise 54. (*) Let G be a group of order 16, which contains an element g 
of order 4. Show that (g^) is normal in G. 

Answer. There are two cases: 

1. If (g) is normal in G, then (g^) is also normal in G, since it is characteristic 
in G (this can be checked by the definition, take any automorphism / of 
(g), it thus maps some to some g^ and thus fiig'^Y) = — (.9"')^ ^ 

2. If {g) is not normal in G, then the subgroup 

H = {heG, h{g) = {g}h} 

(called the normalizer of (g)) will be order 8. Indeed iJ is a subgroup of 
G, thus \H\ divides 16, that is \H\ = 1,2,4,8, 16, and \H\ > \{g)\, so that 
\H\ = 4,8,16. It cannot be 16, since otherwise (g) would be normal in 
G. Finally, if \H\ = 4, then that would mean that (g) = H. This is not 
possible either (see result on nilpotent groups (currently Prop 1.50) in the 
lecture notes). If we can now prove that (5^) is characteristic in H, then 
we are done, since then (g^) is characteristic in H which is normal in G 
(it is of index 2 in G) thus (g^) is normal in G. 
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We are thus left to prove that if is a group of order 8, with a cycUc group (g) 
of order 4, then (_g^) is characteristic in H. 

1. If iJ is abehan, then H is isomorphic to Cs or to (g) x C2. In both cases 
it is characteristic (in the first case, (g^) is a subgroup of a cychc group, 
thus it is characteristic). 

2. If H is not abehan, then its center Z{H) is of order 2 (the order of the 
center has to divide \H\, if it were 4 or 8, then H/Z{H) would be of 
order 2 or 1, thus cyclic, which implies that H is abelian). Furthermore, 
the center intersects non trivially every non-trivial normal subgroup of H. 
Thus Z{H) = {g^) since this is the only subgroup of (g) of order 2 and (g) 
is normal in H (it is of index 2). Finally Z{H) is a characteristic subgroup 
of H which concludes the proof. 

Exercise 55. True/False. 

Ql. There are 3 kinds of groups of order 4, up to isomorphism. 

Q2. Let H and K be two subgroups of G. Then HK is a subgroup of G. 

Q3. Let G be a group, and let X be a set. Then the orbit B(x) of x in X under 
the action of G is a subgroup of G. 

Q4. The dihedral group -Dio of order 20 is simple. 

Q5. The dihedral group D3 is isomorphic to the symmetric group S3. 

Q6. Let H and N be two subgroups of G, with N normal in G. Then the 
following two quotient groups are isomorphic: HN/N ~ H/N. 

Q7. Every simple p-group G is abelian. 

Q8. The number of elements in any conjugacy class of a finite group G divides 
the order of G. 

Q9. The Klein group is a 2-group. 

QIO. Let G be a cyclic group of order n. Then there is a subgroup of size d for 
each positive divisor d of n. 

Answer. 

Ql. False. There are only the Klein group C2 x C2 and the cyclic group C4. 

Q2. False. Indeed, you need the extra condition that HK = KH for it be 
true! If you try to find an inverse for hk, you will see you cannot find 
it. Of course, in G it should be k^^h^^ , but without the assumption that 
HK = KH, this element has no reason to live in HK. 

Q3. False. For B{x) to be a subgroup of G, then B{x) at least need to contain 
the identity element 1. However gx has no reason to be 1 in general, since 
X belongs to X which is an arbitrary set. 
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Q4. False. Remember that Dio has order 2 • 10 = 20. Now 20 = (2^) ■5=p'^q 
and we have seen that groups of such order cannot be simple. 

Q5. True. One can for example check out the multiplicatfon table for both 
groups. 

Q6. False. In fact H/N is not even properly defined since N has no reason to 
be included in H to start with. 

Q7. True. G is a p-group, thus its center is non-trivial (result proved in the 
lecture notes). The center of G is always normal in G. Now G is simple, 
thus its normal subgroups are only {1} and G. Thus either the center is 
{1} or it is G. It cannot be one since it's non-trivial, thus it is G and G 
is abelian. 

Q8. True. Use the Orbit-Stabilizer theorem to deduce that the number of 
elements in an orbit divides the order of the group, and now notice that 
a conjugacy class is nothing else than an orbit when G acts on itself by 
conjugation. 

Q9. True. The Klein group is of order 4, and is actually isomorphic to C2 x C2. 
All its elements are order 2. so it is indeed a 2-group. 

QIO. True. It is not true in general for an arbitrary group, but it is true for 
cyclic groups. Indeed, take g to be the generator of G of order n. Now 
if d divides n, then n = kd for some k. Take the subgroup generated by 
. Clearly {g^Y = 1 since = 1. There cannot be a c?' < d such that 
{g^Y = Ij otherwise this would mean the order of G is < n. 
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In the first section below, a ring will be defined as an abstract structure with 
a commutative addition, and a multiplication which may or may not be com- 
mutative. This distinction yields two quite different theories: the theory of 
respectively commutative or non-commutative rings. These notes are mainly 
concerned about commutative rings. 

Non-commutative rings have been an object of systematic study only quite 
recently, during the 20th century. Commutative rings on the contrary have 
appeared though in a hidden way much before, and as many theories, it all goes 
back to Fermat's Last Theorem. 

In 1847, the mathematician Lame announced a solution of Fermat's Last 
Theorem, but Liouville noticed that the proof depended on a unique decompo- 
sition into primes, which he thought was unlikely to be true. Though Cauchy 
supported Lame, Kummer was the one who finally published an example in 
1844 (in an obscure journal, rediscovered in 1847) to show that the uniqueness 
of prime decompositions failed. Two years later, he restored the uniqueness by 
introducing what he called "ideal complex numbers" (today, simply "ideals") 
and used it to prove Format's Last Theorem for all n < 100 except n = 37, 59, 
67 and 74. 

It is Dedekind who extracted the important properties of "ideal numbers" , 
defined an "ideal" by its modern properties: namely that of being a subgroup 
which is closed under multiplication by any ring clement. He further introduced 
prime ideals as a generalization of prime numbers. Note that today we still 
use the terminology "Dedekind rings" to describe rings which have in particu- 
lar a good behavior with respect to factorization of prime ideals. In 1882. an 
important paper by Dedekind and Weber developed the theory of rings of poly- 
nomials. At this stage, both rings of polynomials and rings of numbers (rings 
appearing in the context of Fermat's Last Theorem, such as what we call now 
the Gaussian integers) were being studied. But it was separately, and no one 
made connection between these two topics. Dedekind also introduced the term 
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"field" (Korper) for a commutative ring in which every non-zero element has a 
multiplicative inverse but the word "ring" is due to Hilbert, who, motivated by 
studying invariant theory, studied ideals in polynomial rings proving his famous 
"Basis Theorem" in 1893. 

It will take another 30 years and the work of Emmy Noether and Krull to see 
the development of axioms for rings. Emmy Noether, about 1921, is the one who 
made the important step of bringing the two theories of rings of polynomials 
and rings of numbers under a single theory of abstract commutative rings. 

In contrast to commutative ring theory, which grew from number theory, 
non-commutative ring theory developed from an idea of Hamilton, who at- 
tempted to generalize the complex numbers as a two dimensional algebra over 
the reals to a three dimensional algebra. Hamilton, who introduced the idea of 
a vector space, found inspiration in 1843, when he understood that the gener- 
alization was not to three dimensions but to four dimensions and that the price 
to pay was to give up the commutativity of multiplication. The quaternion 
algebra, as Hamilton called it, launched non-commutative ring theory. 

Other natural non-commutative objects that arise arc matrices. They were 
introduced by Cayley in 1850, together with their laws of addition and multi- 
plication and, in 1870, Pierce noted that the now familiar ring axioms held for 
square matrices. 

An early contributor to the theory of non-commutative rings was the Scottish 
mathematician Wedderburn, who in 1905, proved "Wedderburn's Theorem", 
namely that every finite division ring is commutative and so is a field. 

It is only around the 1930's that the theories of commutative and non- 
commutative rings came together and that their ideas began to influence each 
other. 

3.1 Rings, ideals and homomorphisms 

Definition 3.1. A ring R is an abelian group with a multiplication operation 

(a, 6) ab 

which is associative, and satisfies the distributive laws 

a{b + c) = ab + ac, (a + b)c = ac + be 
with identity element 1. 

There is a group structure with the addition operation, but not necessarily 
with the multiplication operation. Thus an element of a ring may or may not be 
invertible with respect to the multiplication operation. Here is the terminology 
used. 

Definition 3.2. Let a, 6 be in a ring R. If a 7^ and b ^ but ab = 0, then 
we say that a and b are zero divisors, li ab = ba ~ 1, we say that a is a unit or 
that a is invertible. 



3.1. RINGS, IDEALS AND HOMOMORPHISMS 



99 



While the addition operation is commutative, it may or not be the case with 
the multipHcation operation. 

Definition 3.3. Let R be ring. If ab = ba for any a, b in R, then R is said to 
be commutative. 

Here are the definitions of two particular kinds of rings where the multipli- 
cation operation behaves well. 

Definition 3.4. An integral domain is a commutative ring with no zero divisor. 
A division ring or skew field is a ring in which every non-zero element a has an 
inverse a~^. A field is a commutative ring in which every non-zero element is 
invertible. 

Let us give two more definitions and then we will discuss several examples. 

Definition 3.5. The characteristic of a ring R, denoted by chari?, is the small- 
est positive integer such that 

n-l = l + l + ... + l = 0. 

V > 

ntimcs 

If there is no such positive integer, we say that the ring has characteristic 0. 

We can also extract smaller rings from a given ring. 

Definition 3.6. A subring of a ring i? is a subset S R that forms a ring 
under the operations of addition and multiplication defined in R. 

Examples 3.1. 1. Z is an integral domain but not a field. 

2. The integers modulo n form a commutative ring, which is an integral 
domain if and only if n is prime. 

3. For n > 2, the n x n matrices A^„(M) with coefficients in M. arc a non- 
commutative ring, but not an integral domain. 

4. The set 

Z\i] = {a + bi, a, be Z}, i"^ = -1, 
is a commutative ring. It is also an integral domain, but not a field. 

5. Let us construct the smallest and also most famous example of division 
ring. Take l,i,j,k to be basis vectors for a 4-dimensional vector space 
over R, and define multiplication by 

i'^ = = = —1, i'i = k, jk = i, ki = j, ji = —ij, kj = —jk, ik = —ki. 
Then 

M = {a + bi + cj + dk, a, b,c,de M} 
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forms a division ring, called the Hamilton's quaternions. So far, we have 
only seen the ring structure. Let us now discuss the fact that every non- 
zero element is invertible. Define the conjugate of an element ft, = a + &i + 
cj + dk & M to he h = a — bi — cj — dk (yes, exactly the same way you did 
it for complex numbers). It is an easy computation (and a good exercise 
if you are not used to the non-commutative world) to check that 

qq^a^ + + c^ + d^ . 

Now take q~^ to be 

-1 9 
q = — • 

Clearly qq^^ = q^^q = 1 and the denominator cannot possibly be 0, but 
iia = b = c = d = 0. 

6. If i? is a ring, then the set R[X] of polynomials with coefficients in i? is a 
ring. 

Similarly to what we did with groups, we now define a map from a ring to 
another which has the property of carrying one ring structure to the other. 

Definition 3.7. Let R, S be two rings. A map f : R ^ S satisfying 

1. f{a + b) = f{a) + f{b) (this is thus a group homomorphism) 

2. f{ab) = f{a)f{b) 

3. f{lR) = Is 

for a, 6 e i? is called ring homomorphism. 

We do need to mention that /(li?) = Is, otherwise, since a ring is not 
a group under multiplication, strange things can happen. For example, if Zg 
denotes the integers mod 6, the map / : Zg — > Zg, n t-)- 3n satisfies that 
/(to + n) = 3(m + n) = 3to + 3n = f(rn) + f{n), and f{n)f(m) = StoStt. — 
Smn ~ f{mn) but /(I) ^ 1 and / is not a ring homomorphism. Notice the 
difference with group homomorphism: from /(a + b) = f{a) + f{b), we deduce 
that /(a -h 0) = /(a) + /(O), that is /(a) = /(a) + /(O). Now because /(a) is 
invertible, it must be that /(O) = 0! Once we reach /(a) = /(a)/(l), because 
/(a) does not have to be invertible, we cannot conclude! 

The notion of "ideal number" was introduced by the mathematician Kum- 
mer, as being some special "numbers" (well, nowadays we call them groups) 
having the property of unique factorization, even when considered over more 
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general rings than Z (a bit of algebraic number theory would be good to make 
this more precise). Today only the name "ideal" is left, and here is what it gives 
in modern terminology: 

Definition 3.8. Let I be a subset of a ring R. Then an additive subgroup of 
R having the property that 

ra for a G I, r Cz R 

is called a left ideal of R. If instead we have 

ar G I for a G I, r G i? 

we say that we have a right ideal of R. If an ideal happens to be both a right 
and a left ideal, then we call it a two-sided ideal of R, or simply an ideal of R. 

Example 3.2. The even integers 2Z = {2n, n G Z} form an ideal of Z. The 
set of polynomials in M.[X] with constant coefficient zero form an ideal of M[X]. 

Of course, for any ring R, both R and {0} are ideals. We thus introduce 
some terminology to precise whether we consider these two trivial ideals. 

Definition 3.9. We say that an ideal I of i? is proper ii I R. We say that 
is it non-trivial ii I ^ R and 1^0. 

If / : i? — > S* is a ring homomorphism, we define the kernel of / in the most 
natural way: 

Ker/ = {r G i?, /(r) = 0}. 

Since a ring homomorphism is in particular a group homomorphism, we already 
know that / is injective if and only if Ker/ = {0}. It is easy to check that Ker/ 
is a proper two-sided ideal: 

• Ker/ is an additive subgroup of R. 

• Take a G Ker/ and r e R. Then 

/M - f{r)f{a) - and f{ar) - f{a)f{r) = 
showing that ra and ar are in Ker/. 

• Then Ker/ has to be proper (that is, Ker/ ^ R), since /(I) = 1 by 
definition. 

We can thus deduce the following (extremely useful) result. 

Lemma 3.1. Suppose f : R ^ S is a ring homomorphism and the only two- 
sided ideals of R are {0} and R. Then f is injective. 

Proof. Since Ker/ is a two-sided ideal of R, then either Ker/ = {0} or Ker/ = R. 
But Ker/ ^ R since /(I) = 1 by definition (in words, Ker/ is a proper ideal). □ 



102 



CHAPTER 3. RING THEORY 



At this point, it may be worth already noticing the analogy between on the 
one hand rings and their two-sided ideals, and on the other hand groups and 
their normal subgroups. 

• Two-sided ideals are stable when the ring acts on them by multiplication, 
either on the right or on the left, and thus 

rar^^ G I, a £ I, r £ R, 

while normal subgroups are stable when the groups on them by conjuga- 
tion 

ghg-^ eH, he H, g e G {H < G). 

• Groups with only trivial normal subgroups are called simple. We will not 
see it formally here, but rings with only trivial two-sided ideals as in the 
above lemma are called simple rings. 

• The kernel of a group homomorphism is a normal subgroup, while the 
kernel of a ring homomorphism is an ideal. 

• Normal subgroups allowed us to define quotient groups. We will see now 
that two-sided ideals will allow to define quotient rings. 

3.2 Quotient rings 

Let I be a proper two-sided ideal of R. Since T is an additive subgroup of R 
by definition, it makes sense to speak of cosets r +1 of I, r G R. Furthermore, 
a ring has a structure of abelian group for addition, so T satisfies the definition 
of a normal subgroup. From group theory, we thus know that it makes sense to 
speak of the quotient group 

Rll = {r + X, r e R}, 

group which is actually abelian (inherited from R being an abelian group for 
the addition). 

We now endow R/X with a multiplication operation as follows. Define 

{r + I){s+I) = rs+I. 

Let us make sure that this is well-defined, namely that it does not depend on 
the choice of the representative in each coset. Suppose that 

r +1 = r' +1, s +1 = s' +1, 

so that a = r' — r E I and b ~ s' ~ s E I. Now 

r' s' = (a + r){b + s) = ah + as + rb -\- rs € rs + I 
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since ab, as and rb belongs to X using that a,b G I and the definition of ideal. 
This tells us r's' is also in the coset rs + X and thus multiplication does not 
depend on the choice of representatives. Note though that this is true only 
because we assumed a two-sided ideal X, otherwise we could not have concluded, 
since we had to deduce that both as and rb arc in I. 

Definition 3.10. The set of coscts of the two-sided ideal I given by 

R/I = {r+I, r eR} 
is a ring with identity 1/? + 1 and zero element O^ -I- 1 called a quotient ring. 

Note that we need the assumption that I is a proper ideal of R to claim that 
R/X contains both an identity and a zero element (if R ^ I, then R/I has only 
one element). 

Example 3.3. Consider the ring of matrices 7M2(IF2[i]), where F2 denotes the 
integers modulo 2, and i is such that = —1 = 1 mod 2. This is thus the ring 
of 2 X 2 matrices with coefficients in 

F2H = {a + ib, a,b£ {0,1}}. 

Let I be the subset of matrices with coefficients taking values and 1 + i only. 
It is a two-sided ideal of A^2(F2[i]). Indeed, take a matrix U G I, a matrix 
M g A^2(lF2[*])i and compute UM and MU. An immediate computation shows 
that all coefficients are of the form a(l + i) with a S F2[j], that is all coefficients 
are in {0, 1 -I- 1}. Clearly I is an additive group. 
We then have a quotient ring 

M2i¥2[i])/I. 

We have seen that Ker/ is a proper two-sided ideal when / is a ring homo- 
morphism. We now prove the converse. 

Proposition 3.2. Every proper two-sided idealT is the kernel of a ring homo- 
morphism. 

Proof. Consider the canonical projection tt that we know from group theory. 
Namely 

TT : i? — > r h-)- 7r(r) = r +1. 

We already know that tt is group homomorphism, and that its kernel is X. We 
are only left to prove that tt is a ring homomorphism: 

• since I is two-sided, then R/X is a ring. 

• 7r(rs) = rs +X ~ {r +I){s +1) = 7r(r)7r(s). 

• 7r(lfl) = Iji+I which is indeed the identity element of R/I. 
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We are now ready to state a factor theorem and a 1st isomorphism theorem 
for rings, the same way we did for groups. It may help to keep in mind the 
analogy between two-sided ideals and normal subgroups mentioned above. 

Assume that we have a ring R which contains a proper two-sided ideal I, 
another ring S', and / : i? — )■ 5 a ring homomorphism. Let tt be the canonical 
projection from R to the quotient group R/I: 




S 



We would like to find a ring homomorphism / : — > S that makes the 
diagram commute, namely 

/(a) = /Xa)) 

for all a E R. 

Theorem 3.3. (Factor Theorem for Rings). Any ring homomorphism f 
whose kernel K contains X can he factored through R/T. In other words, there 
is a unique ring homomorphism / : — > 5 such that / o tt = /. Furthermore 

1. f is an epimorphism if and only if f is. 

2. f is a monomorphism if and only if K = X. 

3. f is an isomorphism if and only if f is an epimorphism and K =1. 

Proof. Since we have already done the proof for groups with many details, here 
we will just mention a few important points in the proof. 

Let a + 1 G R/I such that 7r(a) = a + 1 for a e i?. We define 

f{a + I) = f{a). 

This is the most natural way to do it, however, wc need to make sure that this 
is indeed well-defined, in the sense that it should not depend on the choice of 
the representative taken in the coset. Let us thus take another representative, 
say b € a + X. Since a and b arc in the same coset, they satisfy a — 6 £ I C JC, 
where K — Ker(/) by assumption. Since a — b Cz wc have f{a — b) = and 
thus /(a) = f{b). 

Now that / is well defined, it is an easy computation to check that / inherits 
the property of ring homomorphism from /. 

The rest of the proof works exactly the same as for groups. □ 

The first isomorphism theorem for rings is similar to the one for groups. 

Theorem 3.4. (1st Isomorphism Theorem for Rings). If f : R ^ S is a 

ring homomorphism with kernel K, then the image of f is isomorphic to R/K: 

Iraif) - i?/Ker(/). 
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Proof. We know from the Factor Theorem that 

/ : i?/Ker(/) ^ S 

is an isomorphism if and only if / is an cpimorphism, and clearly / is an epi- 
morphism on its image, which concludes the proof. □ 

Example 3.4. Let us finish Example 3.3. We showed there that Ai2{^2[i]) 
is a quotient ring, where I is the ideal formed of matrices with coefficients in 
{0, 1 + i}. Consider the ring homomorphism: 

/ : X2(F2W) ^ M2{¥2),M = (mkj) ^ f{M) = (m^,, mod 1 + i) 

that is / looks at the coefhcicnts of M mod 1 + i. Its kernel is I and it is 
surjective. By the first isomorphism for rings, we have 

M2{¥2[i])/Io^M2{¥2). 

Example 3.5. A less exotic example, which we will study in more details later 
on, is the following. Consider the map / : M.[X] — C, f{p{X)) = p{i), that is, 
/ takes a polynomial p{X) with real coefficients, and evaluate this polynomial 
in i (J? = — 1). This map is surjective (take the polynomial p{X) = X -\- [z — i), 
z G C) and its kernel is formed by polynomials which, when evaluated in i, 
are giving 0, meaning that i is a root of the polynomial, or cquivalently that 
(X^ + 1) is a factor of the polynomial. Thus Ker(/) = (X^ + 1)M[X] = {p{X) = 
{X"^ + 1)(7(X), qiX) € R[X]}. Using the first isomorphism for rings, we have 

M\X\I{X'^ + 1)R[X] ~ C. 

3.3 The Chinese Remainder Theorem 

The name "Chinese Remainder Theorem" supposely comes from the following 
question: How many soldiers were part of Han Xing's army if, sorted by 3 
columns, 2 soldiers were left, sorted by 5 columns, 3 soldiers were left, and 
sorted by 7 columns, 2 soldiers were left. 

The Chinese Remained Theorem is attributed to Sun Zi (in the 3rd century) , 
and was later published by Qin Jiushao (around 1247). 

We will prove a "general" Chinese Remainder Theorem, rephrased in terms 
of rings and ideals. 

For that let us start by introducing some new definitions about ideals, that 
will collect some of the manipulations one can do on ideals. Let us start with 
the sum. 

Definition 3.11. Let I and J be two ideals of a ring R. The sum of I and J 
is the ideal 

T + J = {x^y, .tGT, yd J}. 
If I and J are right (rcsp. left) ideals, so is their sum. 
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Note that the intersection Tn J" of two (resp. right, left, two-sided) ideals 
of R is again a (resp. right, left, two-sided) ideal of R. 

Definition 3.12. The product of two left (resp. right) ideals I and J' is the 
left (resp. right) ideal 

n 

IJ = x,yi, Xi € J, 2/i e J}. 

4 = 1 

Example 3.6. Take I = 2Z and J = 3Z which are both two-sided ideals of Z. 
We have 

i: + J = {2x + iy, X, yeZ}=Z, 
using Bezout identity (since gcd(2,3) = 1). Also 

n 

lnj = 6Z, IJ ^ 2xi3yi, x,ye1] = 6Z. 
1=1 

We can define a notion of being co-prime for ideals as follows. 

Definition 3.13. The two-sided ideals I and J' of a ring R are relatively prime 
if 

I + J = R. 

In a sense, this definition generalizes Bezout identity for rings. 

Notice that for a commutative ring, if T and J are relatively prime then 

XJ =If^J. 

(This is also illustrated in the above example.) Indeed, we clearly have that 

IJ ClIC^J 

since TJ contains by definition sums of elements xy, x ^X^y ^ J , with xy 
and xy £ J hy definition of two-sided ideal. Conversely 

injcij 

since there exist x € I,y €z J such that x + y = 1 hy definition of relatively 
prime, and for every element a € T (1 J, we have that 

a = a{x + y) = ax + ay ~ xa + ay ^ TJ . 

For R a non-commutative ring, where I, J are two-sided and co-prime, all we 
can say is that 

Xr\J^TJ + JX. 

Indeed, a(x + y) = ax + ay ^ JX + XJ since ax ^ xa. 
Finally, let us extend the notion of "modulo" to ideals. 
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Definition 3.14. If a,b G R and T is an ideal of R, we say that a is congruent 
to b modulo X if 

A last definition this time about rings is needed before we can state the 
theorem. 

Definition 3.15. If i?i,...,i?„ arc rings, the direct product of i?i,...,i?„, 
denoted by HILi^*' defined as the ring of n-tuples (ai,...,a„), g i?^, 
with componentwise addition and multiplication. The zero element is (0, ... , 0) 
and the identity is (1, . . . , 1) where 1 means 1^^ for each i. 

This definition is an immediate generalization of the direct product we stud- 
ied for groups. 

Theorem 3.5. (Chinese Remainder Theorem). Let R be a commutative 
ring, and let Xi, . . . be ideals in R, such that 

!,+!,= R, i^j. 

1. If ai, . . . ,an are elements of R, there exists an element a di R such that 

a = ai mod 1^, i — 1, . . . ,n. 

2. If b is another element of R such that b = ai mod 1^, i ~ I, . . . ,n, then 

b = a mod 0"^^ Z;. 

Conversely, if b satisfies the above congruence, then b = Ui mod li, i = 
l,...,n. 

3. We have that 

n 

R/n2=lI^0^Y[R/I^. 

i=l 

Proof. 1. For j > 1, we have by assumption that Ii + Xj = R, and thus there 
exist bj G Xi and dj G X, such that 

bj + dj ^ 1, i = 2, . . . ,n. 

This yields that 

n 

\{{b,+d,)^l. (3.1) 

Now if we look at the left hand side of the above equation, we have 
(62 + rf2)(fc3 + c?3) ■ ■ ■ {K + d„) - i hbs + b2d3 + d2b3 +d2d3) ■ ■ ■ (6„ + d„) 
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and all the terms actually belong to Ti, but ci := 11^=2 '^j ^ Y[j=2-^3- 
Thus 

Ci = 1 mod Ii 
from (3.1). On the other hand, we also have 

ci = mod 2j 

for j > 1 since ci e 11^=2 -^J ■ 

More generally, for all i, we can find Ci with 

Ci = 1 mod 1;, Ci = mod X,-, j 7^ i. 

Now take arbitrary elements ai, . . . , a,i G R, and set 

a = flici + . . . + a„c„. 

Let us check that a is the solution we are looking for. Since Cj = mod Ij 
,j ^ I, we have for a given i that 

a = a^Ci = mod Ti 

using that Ci = 1 mod Z^. 

2. We have just shown the existence of a solution a modulo Ti for i ~ 1, . . . ,n. 
We now discuss the question of unicity, and show that the solution is 
actually not unique, but any other solution than a is actually congruent 
to a mod n"^^ Xi. 

We have for alH = 1, . . . , n that 

b = ai mod Xi b = a mod Xi b — a = mod Xi 

which finally is equivalent to 

b-ae n^^iT,. 

3. Define the ring homomorphism f : R ^ Yii=i R/^i^ sending 

a ^ f{a) = {a + Xi, . . . ,a+Xn). 

• This map is surjective: take any (ai . . . , a„ +I„) £ Y[i=i ^/-^i^ 
then we must find an a G i? such that /(a) = (oi . . . ,a„ +Xn), 
that is a + Ij = +1^, or equivalently ai = a mod 1;, which is true 
by the first point. 

• Its kernel is given by 

Ker/ = {aeR, f{a) = (Ii,...,I„)} 
= {a e i?, CL <EXi, i = 1, . . . , 7T,} 

i=l 

We conclude using the first isomorphism Theorem for rings. 
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□ 

Example 3.7. If i? = Z, the Chinese Remainder Theorem simphfies to say 
that if n = J^,- rii where the rii are coprime, then 

l/nZ ~ Y[Z/n,Z. 

i 

In the particular case of Example 3.6, we have 

Z/6Z ~ Z/2Z X Z/3Z. 

This version of the Chinese remainder Theorem does not hold in the non- 
commutative case, because the property that IJ^ = Xr\ J does not hold any- 
more, as pointed out earlier. There is though a commutative version if all the 
co-prime ideals are assumed to be two-sided. 

3.4 Maximal and prime ideals 

Here are a few special ideals. 

Definition 3.16. The ideal generated by the non-empty set X of i? is the 
smallest ideal of R that contains X. It is denoted by (X). It is the collection 
of all finite sums of the form ViXiSi. 

Definition 3.17. An ideal generated by a single element a is called a principal 
ideal, denoted by (a). 

Definition 3.18. A maximal ideal in the ring i? is a proper ideal that is not 
contained in any strictly larger proper ideal. 

One can prove that every proper ideal is contained in a maximal ideal, and 
that consequently every ring has at least one maximal ideal. We skip the proof 
here, since it heavily relies on set theory, requires many new definitions and the 
use of Zorn's lemma. 

Instead, let us mention that a correspondence Theorem exists for rings, the 
same way it exists for groups, since we will need it for characterizing maximal 
ideals. 

Tiieorem 3.6. (Correspondence Theorem for rings). //I is a two-sided 
ideal of a ring R, then the canonical map 

TT : R^ R/I 

sets up a one-to-one correspondence between 

• the set of all subrings of R containing! and the set of all subrings of R/T, 

• the set of all ideals of R containing X and the set of all ideals of R/T. 
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Here is a characterization of maximal ideals in commutative rings. 

Theorem 3.7. Let M be an ideal in the commutative ring R. We have 

M maximal R/M is a field. 

Proof. Let us start by assuming that M is maximal. Since R/M is a ring, we 
need to find the multiplicative inverse oi a+AI G R/M assuming that a+M ^ 
in R/M , that is a ^ M . Since M is maximal, the ideal Ra + M has to be R 
itself, since M (ZRa + M. Thus l& Ra + M = R, that is 

1 = ra + m, r G i?, m G M. 

Then 

(r + M){a + M) = ra + M ^{l-m) + M = 1 + M 

proving that r + M is (a + M)^^. 

Conversely, let us assume that R/M is a field. First we notice that AI must 
be a proper ideal of R, since if M = R, then R/M contains only one element 
and 1 0. 

Let N be an ideal of R such that M C N C R and N ^ R. We have to 
prove that M = to conclude that M is maximal. 

By the correspondence Theorem for rings, we have a one-to-one correspon- 
dence between the set of ideals of R containing M, and the set of ideals of R/M. 
Since N is such an ideal, its image 7r(iV) G R/M must be an ideal of R/M, and 
thus must be either {0} or R/M (since R/M is a field). The latter yields that 
N ^ R, which is a contradiction, letting as only possibility that 7r(A^) = {0}, 
and thus N ^ M, which completes the proof. □ 

Definition 3.19. A prime ideal in a commutative ring i? is a proper ideal P 
of R such that for any a,b € R, we have that 

a6 G P ^ a G P or & G P. 

Here is again a characterization of a prime ideal P of P in terms of its 
quotient ring R/P. 

Theorem 3.8. If P is an ideal in the commutative ring R 

P is a prime ideal R/P is an integral domain. 

Proof. Let us start by assuming that P is prime. It is thus proper by definition, 
and R/P is a ring. We must show that the definition of integral domain holds, 
namely that 

(a-fP)(5-fP) = + P^a + P = Por& + P = P. 



Since 



(a + P){b + P) = ab + P = + P, 
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we must have ab £ P, and thus since P is prime, either a E P or b G P, implying 
respectively that either a + P = Poib + P = P. 

Conversely, if R/P is an integral domain, then P must be proper (otherwise 
1 = 0). We now need to check the definition of a prime ideal. Let us thus 
consider ab € P, implying that 

(a + P){b + P) = ab + P = + P. 

Since R/P is an integral domain, cither a + P = Poi-b + P = P, that is 

ae P or b e P, 

which concludes the proof. □ 

Corollary 3.9. In a commutative ring, a maximal ideal is prime. 

Proof. If M is maximal, then R/M is a field, and thus an integral domain, so 
that M is prime. □ 

Corollary 3.10. Let f : R ^ S be an epimorphism of commutative rings. 

1. If S is a field, then Kcr/ is a maximal ideal of R. 

2. If S is an integral domain, then Ker/ is a prime ideal of R. 
Proof. By the first isomorphism theorem for rings, we have that 

S ~ R/Kerf. 

□ 

Example 3.8. Consider the ring Z[X] of polynomials with coefficients in Z, and 
the ideal generated by the indeterminate X, that is {X) is the set of polynomials 
with constant coefficient 0. Clearly {X) is a proper ideal. To show that it is 
prime, consider the following ring homomorphism: 

^ : Z[X] ^ Z, f{X) ^ ^{f{X)) = /(O). 

We have that {X) = Kciip which is prime by the above corollary. 

3.5 Polynomial rings 

For this section, we assume that i? is a commutative ring. Set R[X] to be the 
set of polynomials in the indeterminate X with coefficients in R. It is easy to 
see that R\X] inherits the properties of ring from R. 

We define the evaluation map E^, which evaluates a polynomial f{X) E 
R[X] in X G R, as 

E.:, : R[X] ^ R, f{X) ^ f{X)\x=, = f{x). 
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We can check that E^ is a ring homomorphism. 

The degree of a polynomial is defined as usual, that is, iip{X) ~ ao + aiX + 
. . . + a„X" with a„ 7^ 0, then deg{p{X)) = degp = n. By convention, we set 
deg(O) = —00. 

Euclidean division will play an important role in what will follow. Let us 
start by noticing that there exists a polynomial division algorithm over 
namely: if f,g € R[X], with g monic, then there exist unique polynomials q 
and r in R[X] such that 

/ = 9.9 + r, degr < degg. 

The requirement that g is monic comes from R being a ring and not necessarily 
a field. If i? is a field, g does not have to be monic, since one can always multiply 
g by the inverse of the leading coefficient, which is not possible if R is not a 
field. 

Example 3.9. Take f{X) = - 2 and g{X) = 2X - I. It is not possible to 
divide f{X) by g{X) in Z[X]. If it were, then 

fix) =X''-2 = {qo + qiX){2X - 1) + ro 

and the coefficient of X^ is 1 on the left hand side, and 2qi on the right hand 
side. Now in Z, there is no solution to the equation 2qi = 1. Of course, this is 
possible in Q, by taking qi = 1/2! 

This gives the following: 

Theorem 3.11. (Remainder Theorem). If f E R[X], a E R, then there 
exists a unique polynomial q{X) G R[X] such that 

f{X)=q{X){X ~a) + f{a). 

Hence f{a) = 4=^ X -a \ f{X). 

Proof. Since {X — a) is monic, we can do the division 

f{X)=q{X){X-a) + r{X). 

But now since degr < deg(X — a), r{X) must be a constant polynomial, which 
implies that 

/(«) = r{X) 

and thus 

f{X)=q{X){X-a) + f{a) 
as claimed. Furthermore, we clearly have that 

/(a)=0 ^ X-a\ f{X). 

□ 
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The following result sounds well known, care should be taken not to gener- 
alize it to rings which are not integral domain! 

Theorem 3.12. // R is an integral domain, then a non-zero polynomial f in 
R[X] of degree n has at most n roots in R, counting multiplicity. 

Proof. If / has no root in -R[X], then we are done. Let us thus assume that / 
has a root ai in R, that is /(ai) = 0. Then 

X-ai I f(X) 

by the remainder Theorem above, meaning that 

fiX) = q,iX)iX~a^r 

where 91(01) 7^ and deggi = n — ni since R is an integral domain. Now if 
fli is the only root of / in i?, then ni < n and we are done. If not, consider 
similarly 02 7^ ai another root of /, so that 

= /(a2) -gi(a2)(a2-ai)"^ 

Since R is an integral domain, we must have that 91(02) = 0, and thus 02 is a 
root of qi{X). We can repeat the process with qi{X) instead of f(X): since 02 
is a root of qi{X), we have 

q^iX) ^ q2{X){X - 

with 92(02) 7^ and deg92 = 7^ — ?^l — ?i2- By going on iterating the process, 
we obtain 

fix) = q,{X){X^a,r 

= q2{X){X-a2T'{X-air' 

= {X- {X ~ ■■■{X- auT" ■ c{X) 

where c{X) is a polynomial with no root in i?, possibly constant, and 

n>ni + n2 + --- + nfc. 

Since R is an integral domain, the only possible roots of / are oi, . . . , a^, fc < n, 
and the number of roots counting multiplicity is less than n. □ 

Example 3.10. Take i? = Zg the ring of integers modulo 8. Consider the 
polynomial 

f{X)^X\ 

It is easy to check that is has 4 roots: 0, 2, 4, 6. This comes from the fact that 
Zg is not an integral domain. 
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3.6 Unique factorization and Euclidean division 

In this section, all rings arc assumed to be integral domains. 

Let us start by defining formally the notions of irreducible and prime. The 
elements a, 6, c, u in the definitions below all belong to an integral domain R. 

Definition 3.20. The elements a, b are called associate if a — ub for some unit 
u. 

Definition 3.21. Let a be a non-zero element which is not a unit. Then a is 
said to be irreducible if a = 5c implies that either & or c must be a unit. 

Definition 3.22. Let a be a non-zero element which is not a unit. Then a is 
called prime if whenever a \ be, then a \ b or a \ c. 

Between prime and irreducible, which notion is the stronger? The answer is 
in the proposition below. 

Proposition 3.13. If a is prime, then a is irreducible. 

Proof. Suppose that a is prime, and that a = be. We want to prove that either 
6 or c is a unit. By definition of prime, we must have that a divides either b or 
c. Let us say that a divides b. Thus 

b = ad ^ b = bed ^ b{l - ed) = ^ cd = 1 

using that R is an integral domain, and thus c is a unit. The same argument 
works if we assume that a divides c, and we conclude that a is irreducible. □ 

Example 3.11. Consider the ring 

R = Z[V^] = {a + ibVs, a,b e Z}. 
We want to see that 2 is irreducible but not prime. 

• Let us first check that 2 is indeed irreducible. Suppose that 

2 = {a + ibVi){e + idVi). 
Since 2 is real, it is equal to its conjugate, and thus 

22 = (a + ibS){e + idS){a - ibV3){e - idVS) 

implies that 

4= (a^ -f 362)(c2 -fSd^). 

We deduce that + 35^ must divide 4, and it cannot possibly be 2, since 
we have a sum of squares in Z. If + 36^ = 4, then -I- 3d^ = 1 and 
d = 0, c = ±1. Vice versa if -|- 3d^ = 4 then -I- 3&^ = 1, and 6 = 0, 
a = ±1. In both cases we get that one of the factors of 2 is unit, namely 
±1. 
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• We now have to see that 2 is not a prime. Clearly 

2 I {l + iV3){l-iV3) =4. 

But 2 divides neither 1 + i-\/3 nor 

We can see from the above example that the problem which arises is the lack 
of unique factorization. 

Definition 3.23. A unique factorization domain (UFD) is an integral domain 
R satisfying that 

1. every element 7^ a G i? can be written as a product of irreducible factors 
Pi, . . .p„ up to a unit u, namely: 

a = upi . ..pn- 

2. The above factorization is unique, that is, if 

a^upi...pn= vqi ...Qm 

are two factorizations into irreducible factors pi and qj with units u,v, 
then n = m and pi and Qi are associate for all i. 

We now prove that the distinction between irreducible and prime disappear 
in a unique factorization domain. 

Proposition 3.14. In a unique factorization domain R, we have that a is 
irreducible if and only if a is prime. 

Proof. We already know that prime implies irreducible. Let us show that now, 
we also have irreducible implies prime. 

Take a to be irreducible and assume that a \ be. This means that be — ad 
for some d Cz R. Using the property of unique factorization, we decompose d, b 
and c into products of irreducible terms (resp. di, bi, Ci up to units u,v,w): 

a ■ udi ■ ■ ■ dr = vbi ■ ■ - bs ■ wci . . .Cf. 

Since the factorization is unique, a must be associate to some either bi or 
implying that a divides b or c, which concludes the proof. □ 

We now want to connect the property of unique factorization to ideals. 

Definition 3.24. Let ai,a2,... be elements of an integral domain R. If the 
sequence of principal ideals 

(ai) C (aa) C {a^) C ... 

stabilizes, i.e., we have 

(a„) = (a„+i) = . . . 

for some n, then we say that R satisfies the ascending chain condition on prin- 
cipal ideals. 

If the same condition holds but for general ideals, not necessarily principal, 
we call R a Noctherian ring, in honor of the mathematician Emmy Noether. 
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Examples 3.12. 1. Consider the polynomial ring in infinitely many inde- 
terminates Xi,X2, ■ ■ ■ over R. The chain 

(Xi) C iXi,X2) C (Xi,X2,X3) C ... 

of non-principal ideals is ascending and docs not terminate. The ideal 
generated by all indeterminates is maximal. 

2. Consider the polynomial ring Z + XQ[X] of all rational polynomials with 
integral constant term. The chain 

{X) C {X/2) C (X/4) C . . . 

of principal ideals is ascending and does not terminate. 

Theorem 3.15. Let R be an integral domain. 

1. If R is a UFD, then R satisfies the ascending chain condition on principal 
ideals. 

2. If R satisfies the ascending chain condition on principal ideals, then every 
non-zero element of R can be factored into irreducible ( this says nothing 
about the unicity of the factorization). 

3. If R is such that every non-zero element of R can be factored into irre- 
ducible, and in addition every irreducible element is prime, then R is a 
UFD. 

Thus R is a UFD if and only if it satisfies the ascending chain condition on 
principal ideals and every irreducible element of R is prime. 
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Proof. 1. Recall that in a UFD, prime and irreducible are equivalent. Con- 
sider an ascending chain of principal ideals 

(ai) C (as) C (aa) C ... 

We have that ai+i | for all i. Thus the prime factors of a^+i consist of 
some (possibly all) prime factors of a^. Since oi has a unique factorization 
into finitely many prime factors, the prime factors will end up being the 
same, and the chain will stabilize. 

2. Take 7^ ai G i?. If ai is irreducible, we are done. Let us thus assume 
that oi is not irreducible, that is 

oi = 0262 

where 02 and &2 arc not unit. Since 02 | ai, wc have (ai) C (02), and 
actually 

(fli) C (as). 

Indeed, if (ai) — {02), then 02 would be a multiple of ai, namely 02 = cai 
and thus 

ai = a252 =4> Oi = cai^s =^ ai(l ~ C62) = 

implying that 062 = 1 and thus 62 is a unit. This contradicts the as- 
sumption that ai is not irreducible. This computation has shown us that 
whenever we get a factor which is not irreducible, we can add a new princi- 
pal ideal to the chain of ideals. Thus, if 02^2 is a product of irreducible, we 
are done. Otherwise, we have that say 02 is not irreducible, and 02 = 0363, 
yielding 

(ai) C (as) C (as). 

Since R satisfies the ascending chain condition on principal ideals, this 
process cannot go on and must stop, showing that we have a factorization 
into irreducible. 

3. We now know that R allows a factorization into irreducible. We want to 
prove that this factorization is unique, under the assumption that every 
irreducible is prime. Suppose thus that 

a = UP1P2 ■ ■ ■ p„ = vqiq2 ■ ■ ■ qm 

where u, v are units and pi, qj are irreducible, pi is an irreducible but also 
a prime by assumption, thus it must divide one of the qj, say qi, and we 
have qi = pid. Since qi is irreducible, d must be a unit, and qi and pi are 
associate. We can iterated the process to find that qi and pi are associate 
for all i. 

□ 

We now introduce a notion stronger than being a unique factorization do- 
main. 
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Definition 3.25. A principal ideal domain (PID) is an integral domain in which 
every ideal is principal. 

Theorem 3.16. A principal ideal domain R is a unique factorization domain. 
Proof. What we will prove is that if i? is a principal ideal domain, then 

• R satisfies the ascending chain condition on principal ideals. 

• every irreducible in R is also prime. 

Having proved these two claims, we can conclude using the above theorem. 

Let us first prove that R satisfies the ascending chain condition on principle 
ideals. Consider the following sequence of principal ideals 

(fli) C (a2) C (as) . . . 

and let X = U°^i{ai). Note that I is an ideal of R (be careful, a union of ideals 
is not an ideal in general!). Indeed, we have that X is closed under addition: 
take a, G I, then there are ideals (a^) and (afe) in the chain with a G (aj) and 
b € (flfe). If TO > max(j, k), then both a, 6 € (om) and so do a + b. To check that 
I is closed under multiplication by an element of R, take again a € I. Then 
a €E (flj) for some j. If r e i?, then ra S (oj) implying that ra G T. 

Now by assumption, I is a principal ideal, generated by, say b: X ^ (b). 
Since b belongs to U°^i{ai), it must belongs to some (a„). Thus I = (b) C (a„). 
For j > n, we have 

(flj) C I C (a„) C (aj) 

which proves that the chain of ideal stabilizes. 

We are left to prove that every irreducible element is also prime. Let thus 
a be an irreducible element. Consider the principal ideal (a) generated by a. 
Note that (o) is a proper ideal: if (a) = R, then 1 G (a) and thus a is a unit, 
which is a contradiction. 

We have that (a) is included in a maximal ideal X (this can be deduced from 
either the ascending chain condition or from the theorem (Krull's theorem) that 
proves that every ideal is contained in a maximal ideal). Since i? is a principal 
ideal domain, we have that X = (6). Thus 

(a) <Z (b) ^ b \ a ^ a = bd 

where a is irreducible, b cannot be a unit (since I is by definition of maximal 
ideal a proper ideal), and thus d has to be a unit of R. In other words, a and b 
are associate. Thus 

(a)=I=(6). 



Since I is a maximal ideal, it is prime implying that a is prime, which concludes 
the proof. □ 
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Determining whether a ring is a principal ideal domain is in general quite 
a tough question. It is still an open conjecture (called Gauss's conjecture) to 
decide whether there are infinitely many real quadratic fields which are principal 
(we use the terminology "principal" for quadratic fields by abuse of notation, it 
actually refers to their ring of integers, that is rings of the form cither 7L\\f3\ if 

d = 2 or 3 mod 4 or else). 

One way mathematicians have found to approach this question is to actually 
prove a stronger property, namely whether a ring i? is Euclidean. 

Definition 3.26. Let R be an integral domain. We say that i? is a Euclidean 
domain if there is a function from i?\{0} to the non-negative integers such 
that 

a ~ bq + a,b ^ R, b ^ 0, q,r E R 
where either r = or 5'(r) < ^'(6). 

When the division is performed with natural numbers, it is clear what it 
means that r < b. When we work with polynomials instead, we can say that 
degr < dcgfe. The function generalizes these notions. 

Theorem 3.17. If R is a Euclidean domain, then R is a principal ideal domain. 

Proof. Let I be an ideal of R. If I = {0}, it is principal and we are done. Let 
us thus take I ^ {0}. Consider the set 

{1'(6), 6 el, &7^0}. 

It is included in the non- negative integers by definition of 5*, thus it contains a 
smallest element, say n. Let 7^ 6 G I such that ^'(6) = n. 

We will now prove that I = (6) . Indeed, take a € I, and compute 

a = bq^ r 

where r = or 'l'(r) < 4'(fe). This yields 

r = a — bq 

and \l/(r) < 5* (6) cannot possibly happen by minimality of n, forcing r to be 
zero. This concludes the proof. □ 

Example 3.13. Consider the ring 

= {a + a, e Z} 

with 

*(a + fe\/d) = -b^d\. 

We will show that we have a Euclidean domain for d = —2, —1, 2. 

Note that U\\fd\ is an integral domain. Take a,/3 7^ in Z[\/d]. Now we 
would like to perform the division of a by /3 to get something of the form 

a ^ Pq + r, q^r ^ 'L[\/d] . 



120 



CHAPTER 3. RING THEORY 



Since Z[-\/d] is not a field, there is no reason for this division to give a result in 
1\\fd\ (that is, q,r £ however, we can compute the division in 'Q(\/d): 

a/fi ^ q', 

with q' ~ X + Vdy with x, y rational. Let us now approximate x, y by integers 
xo,yo, namely take xo,yo such that 

\x-xo\ < 1/2, \y-yo\ < 1/2. 

Take 

q = xo + yoVd, r = I3((x - xq) + (y - yo)Vd), 
where clearly q G Z[\/d], then 

Pq + r = /3{xo + yoVd) + I3{{x - xq) + {y - yo)Vd) 
= I3{x + yVd) = M = a, 

which at the same time shows that r e Z[\/rf]. We arc left to show that '^{r) < 
^{(i). We have 

■^{r) = «'(/3)^'((a; - .To) + (y - yo)yd) 
= vE'(/3)|(x--Xo)2-d(y-yo)'l 
< vI/(/3)[|.T-a:oP + M||y-yoP] 

showing that 'L[\fd] is indeed a Euclidean domain for d = —2, —1, 2. 

Below is a summary of the ring hierarchy (recall that PID and UFD stand 
respectively for principal ideal domain and unique factorization domain): 



integral domains D UFD D PID D Euclidean domains 



Note that though the Euclidean division may sound like an elementary con- 
cept, as soon as the ring we consider is fancier than Z, it becomes quickly 
a difficult problem. We can see that from the fact that being Euclidean is 
stronger than being a principal ideal domain. All the inclusions are strict, since 
one may check that Z[-v/— 3] is an integral domain but is not a UFD, is a 

UFD which is not PID, while Z[(l + zyT9)/2] is a PID which is not a Euchdean 
domain. 
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ring 


ED 


PID 


UFD 


ID 


Z 


yes 


yes 


yes 


yes 


F[X], F a. field 


yes 


yes 


yes 


yes 




yes 


yes 


yes 


yes 




yes 


yes 


yes 


yes 




yes 


yes 


yes 


yes 


Z[{l + iVl9)/2] 


no 


yes 


yes 


yes 


Z[X] 


no 


no 


yes 


yes 


Z[y=3] 


no 


no 


no 


yes 



Table 3.1: Examples of rings we saw: that Z[V3] is a Euclidean domain is 
done in the exercises, that Z[X] is not a principal ideal domain is also shown in 
the exercises, it is enough to show that the ideal (2,X) is not principal. Finally 
Z[^/^] is not a unique factorization domain because we saw that 2 is irreducible 
but not prime. 

3.7 Irreducible polynomials 

Recall the definition of irreducible that we have seen: a non-zero element a 
which is not a unit is said to be irreducible if a = 6c implies that either b or c 
is a unit. Let us focus on the case where the ring is a ring of polynomials R[X] 
and R is an integral domain. 

Definition 3.27. If R is an integral domain, then an irreducible element of 
R[X] is called an irreducible polynomial. 

In the case of a field F, then units of F[X] are non-zero elements of F. 
Then we get the more familiar definition that an irreducible element of F[X] is 
a polynomial of degree at least 1, that cannot be factored into two polynomials 
of lower degree. 

Let us now consider the more general case where R is an integral domain 
(thus not necessarily a field, it may not even be a unique factorization domain). 
To study when polynomials over an integral domain R are irreducible, it is 
often more convenient to place oneselves in a suitable field that contains R, 
since division in R can be problematic. To do so, we will now introduce the 
field of fractions, also called quotient field, of R. Since there is not much more 
difficulty in treating the general case, that is, when i? is a commutative ring, 
we present this construction. 

Let 5 be a subset of R which is closed under multiplication, contains 1 and 
does not contain 0. This definition includes the set of all non-zero elements of 
an integral domain, or the set of all non-zero elements of a commutative ring 
that are not zero divisors. We define the following equivalence relation on i? x 5: 

(a, b) ^ (c, d) <=> s{ad — be) ~ for some s G S*. 

It is clearly refiexive and symmetric. Let us check the transitivity. Suppose that 
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(a, b) ^ (c, d) and (c, d) ^ (e, /). Then 

s{ad — be) ^ and t{cf — de) = 

for some s,t G S. We can now multiply the first equation by i/, the second by 
sb and add them 

stf{ad - be) + tsb{cf - de) = 

to get 

sdt{fa - foe) = 

which proves the transitivity. 

What we are trying to do here is to mimic the way we deal with Z. If we take 
non-zero a,b,e,d G Z, we can write down a/b = c/d, or equivalently ad = be, 
which is also what (a, b) ~ (c, d) satisfies by definition if we take R to be an 
integral domain. In a sense, (a, 6) is some approximation of a/b. 

Formally, if a G i? and 6 G 5, we define the fraction a/6 to be the equivalence 
class of the pair (a, &). The set of all equivalence classes is denoted by S~^R. 
To make it into a ring, we define the following laws in a natural way: 



addition: 



multiplication: 



additive identity: 



additive inverse: 



multiplicative identity: 



a e ad + be 



b d bd 

a e ae 
bd " bd' 





1 " s' 



s G S. 



a —a 
b ^ ~b' 



1 _ s 
1 ^ s' 



s G S. 



To prove that we really obtain a ring, we need to check that all these laws 
are well-defined. 

Theorem 3.18. With the above definitions, the set of equivalence elasses S^^R 
is a eommutative ring. 

1. If R is an integral domain, so is S^^R. 

2. If R is an integral domain, and S = i?\{0}, then S^^R is a field. 
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Proof. Addition is well-defined. If ai/bi = ci/di and 02/62 = C2/d2, then 
for some s,t Cz S, we have 

s{aidi — &1C1) = and t(a2d2 — &2C2) = 0. 

We can now multiply the first equation by t62'^2 and the second by s&idi to get 

tb2d2s(aidi — biCi) = and sbidit{a2d2 — ^222) — 0, 

and adding them yields 

st[d2di{b2ai + bia2) — b2b\{d2Ci + diC2)] — 

that is 

6201 + bia2 _ d2Ci + diC2 
6261 d2di 

which can be rewritten as 

Oi ^ 02 _ Ci ^ C2 

bi 62 di d2 

and we conclude that addition does not depend on the choice of a representative 
in an equivalence class. 

Multiplication is well-defined. We start as before. If ai/bi = ci/di and 
0-2/^2 = C2/d2, then for some s,< G 5, we have 

s{aidi — &1C1) — and t{a2d2 — 62C2) = 0. 

Now we multiply instead the first equation by ta2d2, the second by scibi and 
we add them: 

st[a2d2aidi - C16162C2] = 0. 
This implies, as desired, that 

aia2 _ C1C2 
&162 dlC?2 

To be complete, one should check that the properties of a ring are fulfilled, but 
this follows from the fact that addition and multiplication are carried the usual 
way. 

1. We want to prove that S~^R. is an integral domain. We assume that R 
is an integral domain, and we need to check the definition of an integral 
domain for S~^R. Namely, suppose that {a/b){c/d) = in S~^R, that is 

a c 
bd " 1' 

This means that (ac, bd) ^ (0, 1) and acs = for some s e S. Now acs ~ 
is an equation in R, which is an integral domain, and s ^ 0, thus ac ~ 0, 
so either a or c is 0, and consequently either a/6 or c/d is zero. 
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2. To conclude, we want to prove that S~^R is a field, assuming that R is 
an integral domain, and S = R\{0}. We consider a/6 a non-zero element 
of S^^R, for which we need to find an inverse. Note that a and b are 
non-zero, thus they are both in S meaning that both a/b and b/a are in 
S~^R and b/a is the multiplicative inverse of a/b. 

□ 

Definition 3.28. Let i? be a commutative ring. Based on the above, the set 
of equivalence classes S~^R is a commutative ring, called the ring of fractions 
of R by S. If R is an integral domain, and S = i?\{0}, then S~^R is called the 
field of fractions or quotient field of R. 

Now that we have defined a suitable field, we are left to prove that we can 
embed an integral domain R in its quotient field. 

Proposition 3.19. A commutative ring R can be embedded in its ring of frac- 
tions S~^R, where S is the set of all its non-divisors of zero. In particular, an 
integral domain can be embedded in its quotient field, which is furthermore the 
smallest field containing R. 

Proof. Consider the following map: 

/ : R^ S-^R, f{a) = a/1. 

It is not hard to check that / is a ring homomorphism. If S has no zero divisor, 
we have that the kernel of / is given by the set of a such that /(a) = a/1 = 0/1, 
that is the set of a such that sa = for some s. Since s is not a zero divisor, 
we have a = and / is a monomorphism. □ 

Let us get back to the irreducible polynomials, and consider now the case 
where Z? is a unique factorization domain. It is not necessarily a field, but we 
now know how to embed it in a suitable field, namely its field of fractions, or 
quotient field. Take the polynomial f{X) = a -\- abX, a ^ not a unit. Since 
we can factor it as 

f{X) = a(l + bX) 

where a is not a unit by assumption, this polynomial is not irreducible. But we 
do not really have a factorization into two polynomials of lower degree. What 
happens here is that the constant polynomials are not necessarily units, unlike in 
the case of fields. To distinguish this case, we introduce the notion of primitive 
polynomial. 

Definition 3.29. Let D be a unique factorization domain and let / € 
We call the greatest common divisor of all the coefficients of / the content of 
/, denoted by c(/). A polynomial whose content is a unit is called a primitive 
polynomial. 
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Figure 3.2: Carl Fricdrich Gauss (1777-1855) 



Wc can now rule out the above example, and we will prove later that this 
allows us to say that a primitive polynomial is irredueible if and only if it 
cannot be factored into two polynomials of lower degree. Be careful however 
that "primitive polynomial" has a different meaning if it is defined over a field. 

The next goal is to prove Gauss lemma, which in particular implies that the 
product of two primitive polynomials is a primitive polynomial. 

We start with a lemma. 

Lemma 3.20. Let D be a unique factorization domain, and consider f ^ 
0,g,h e L>[X] such that pf(X) = g(X)h(X) with p a prime. Then either p 
divides all the coefficients of g or p divides all the coefficients of h. 

Before starting the proof, let us notice that this lemma is somehow a gener- 
alization of the notion of prime. Instead of saying that p\ah implies p\a or p\h, we 
have p\g{X)h{X) implies that p\g{X) or p\h{X) (dividing the whole polynomial 
means dividing all of its coefficients) . 

Proof. Denote 

g{X) = .90 +giX + ...+ g^X', h{X) = ho + hiX + ... + htX\ 

Suppose by contradiction that p does not divide all coefficients of g and does 
not divide all coefficients of h either. Then let g^ and h^ be the coefficients of 
minimum index not divisible by p. Then the coefficient of in g{X)h{X) 

is 

gahu+v + gihu+v-i + . . . + guhy + . . . + gu+v-ihi + g-u+vho. 

By definition of u and v, p divides every term but guhy, thus p cannot possibly 
divide the entire expression, and thus there exists a coefficient of g{X)h{X) not 
divisible by p. This contradicts the fact that p\g{X)h{X) . □ 
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Proposition 3.21. (Gauss Lemma). Let f,g be non- constant polynomials 
in D[X] where D is a unique factorization domain. The content of a product of 
polynomials is the product of the contents, namely 

c(/g)=c(/)c(.9), 

up to associates. In particular, the product of two primitive polynomials is 
primitive. 

Proof. Let us start by noticing that by definition of content, we can rewrite 

fiX) = c{f)f*iX), g{X) = c(g)g*{X), 
where f*,g* G D[X] arc primitive. Clearly 

.f9^cif)cig)f*g*. 

Since c{f)c{g) divides fg, it divides every coefBcient of fg and thus their 
greatest common divisor: 

c(/)c(g) I cigf). 

We now prove the converse, namely that c{gf)\ \c{g)c{g). To do that, we 
consider each prime p appearing in the factorization of c{gf) and argue that 
p I c{f)c{g). Let thus p be a prime factor of c{gf). Since fg = c{fg){fg)*, we 
have that c{fg) divides fg, that is 

P I fg- 

By the above lemma, either p \ f oi p \ g, say p \ f = c{f)f* , meaning that 
either p \ c{f) or p \ /*. Since /* is primitive, p cannot possibly divide /*, and 
thus 

p I c{f)=>p I c{f)c{g). 
If p appears with multiplicity, we iterate the reasoning with the same p. □ 

We are now ready to connect irreducibility over a unique factorization do- 
main and irreducibility over the corresponding quotient field or field of fractions. 

Proposition 3.22. Let D he a unique factorization domain with quotient field 
F. If f is a non-constant polynomial in D[X], then f is irreducible over D if 
and only if f is primitive and f is irreducible over F . 

For example, this says that / is irreducible over Z if and only if / is primitive, 
and / is irreducible over Q. 

Proof. First assume that / is irreducible over D. 

f is primitive. Indeed, if / were not primitive, then we could write 

/ = c(/)r. 
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where c(/) denotes the content of / and /* is primitive. Since we assume / is 
not primitive, its content cannot be a unit, which contradicts the irreducibihty 
of / over D, and we conclude that / is primitive. 

/ is irreducible over F. Again assume by contradiction that / is not 
irreducible over F. Now F is a field, thus reducible means / can be factored 
into a product of two non-constant polynomials in F[X] of smaller degree: 

f{X)^g{X)h{X), dcg5<dcg/, dcg/Kdcg/. 

Since g, h arc in and F is the field of fractions of D, we can write 

g{X) = ^g*{X), h{X) = ^^(X), a,b,c,deD 
a 

and g* , h* primitive. Thus 

f{X) = -g*{X)h*{X) 

where g*h* is a primitive polynomial by Gauss Lemma. Since we have already 
proven (in the 1st part) that / is primitive, it must be that ac/bd = u is a unit. 
But this would mean that 

f{X)=ug*iX)h*iX) 

which contradicts the fact that f{X) is irreducible over D[X] and we conclude 
that / is also irreducible over 

We are left to prove the converse. Let then / be a primitive and / be an 
irreducible polynomial over F. We do it by contraction, and assume that the 
primitive polynomial / is not irreducible over D: 

f{X)^g{X)h{X). 

Since / is primitive, degg and deg/i are at least 1. But then neither g not h 
can be a unit in F[X] (these are units in F) and thus 

f^gh 

contradicts the irreducibihty of / over F . □ 

In other words, we have proven that / irreducible over D is equivalent to / 
primitive and cannot be factored into two polynomials of lower degree in -FfX]. 

To conclude, we present a practical criterion to decide whether a polynomial 
in D[X] is irreducible over F. 

Proposition 3.23. (Eisenstein's criterion). Let D he a unique factorization 
domain, with quotient field F and let 

/(X) = a„X" + ... + aiX + ao 

be a polynomial in D[X] with n> 1 and a„ 7^ 0. 

If p is a prime in D and p divides Oi, < i < n but p does not divide a„ 
nor does p^ divide Qq, then f is irreducible over F. 
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Figure 3.3: Ferdinand Eisenstein (1823-1852) 



Proof. We first divide / by its content, to get a primitive polynomial. By 
the above proposition, it is enough to prove that this primitive polynomial is 
irreducible over D. 

Let thus / be a primitive polynomial and assume by contradiction it is 
reducible, that is 

f{X) ^ g{X)hiX) 

with 

giX) = 50 + . . . + .gr^^ HX) ^ho + ... + h,X'. 

Notice that r cannot be zero, for if r ~ 0, then go — g would divide / and 
thus all Qi implying that go divides the content of / and is thus a unit. But this 
would contradict the fact that / is reducible. We may from now on assume that 

r > 1, s > 1. 

Now by hypothesis, p \ ao = (?o^o but does not divide Oq, meaning that p 
cannot divide both go and ho. Let us say that 

P I 90 

and p does not divide ho (and vice- versa) . 

By looking at the dominant coefficient a„ = grhs, we deduce from the as- 
sumption that p does not divide a„ that p cannot possibly divide gr- Let i be 
the smallest integer such that p does not divide g,; . Then 

l<i<r<n = r + s. 
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Let us look at the iih coefficient 

a-i = 9ohi + gih^-i + . . . + 

and by choice of i, p must divide g^, . . . , gi-i- Since p divides by assumption, 
it thus must divide the last term giho, and either p \gi or p \ Hq by definition of 
prime. Both are impossible: we have chosen p dividing neither Hq nor gi . This 
concludes the proof. □ 



The main definitions and results of this chapter are 

• (2.1-2.2). Definitions of: ring, zero divisor, unit, 
integral domain, division ring, subring, characteristic, 
ring homomorphism, ideal, quotient ring. Factor and 
1st Isomorphism Theorem for rings. 

• (2.3-2.4). Operations on ideals, Chinese Remainder 
Theorem, Correspondence Theorem for rings. Defini- 
tions of: principal ideal, maximal ideal, prime ideal, 
the characterization of the two latter in the commu- 
tative case. 

• (2.5). Polynomial Euclidean division, number of 
roots of a polynomial. 

• (2.6). Definitions of: associate, prime, irreducible, 
unique factorization domain, ascending chain condi- 
tion, principal ideal domain, Euclidean domain. Con- 
nections between prime and irreducible. Hierarchy 
among UFD, PID and Euclidean domains. 

• (2.7). Construction of ring of fractions. Definitions 
of: content of a polynomial, primitive polynomial. 
Gauss Lemma, Eisenstein's criterion. 
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Chapter T 

Exercises on Ring Theory 



Exercises marked by (*) are considered difficult. 

4.1 Rings, ideals and homomorphisms 

Exercise 56. Let i? be a ring and x E R. Suppose tliere exists a positive 
integer n sucli ttiat x" = 0. Sliow that 1 + x is a unit, and so is 1 — x. 

Answer. Tlie element 1 — a; is a unit since 

+ + + = 1. 

The element 1 + 2: is a unit since 

(1 + x){l ^x + x^ ~x^ ...± = 1. 

Exercise 57. Let i? be a commutative ring, and / be an ideal of R. Show that 

\/7 := {x G i? I there exists to € N* such that x"" G /} 

is an ideal of R. Answer. 

• Clearly, G y/l. If a G y/l, then a™ G / for some m > 1. Then 
(-a)™ = (-l)'"a™ G /, so -a G /F. Now let a, 6 G 77, so a" G / 
for some n > 1 and b™ G / for some m > 1. Now let us show that 

(a + &)"+'" G /. We have la + 5)"+'" = V -70^6"+™-^' 

■^j!(n + m-j)! 

(because R is commutative). Now if < J < 't^, wc have n + m — j > m, 
so G / in this case (since 6™ G / ^ 6' G / for i > ?7i). If 

n + 1 < j < n + TO, we have j > n + 1, so a-' G / in this case (since 
a" G / a' G / for i > n). Therefore all the terms in the previous sum 
are in / and thus (a + 6)"+™ g /. Hence a + 6 G Vl. We just proved that 
VT is an additive subgroup of R. 
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• Now we have to check the second property. Let a G Vl, and r € R. We 
have G I for some n> 1. Now (ar)" = a"r" because R is commutative, 
so (ar)" g / and therefore ar e \/7- Therefore %// is an ideal of R. 

Exercise 58. Determine all rings of cardinality p and characteristic p. 

Answer. Let i? be a ring of characteristic p. Consider the ring homomorphism: 
if : Z R, the characteristic of R is the natural number p such that pi, is the 
kernel of ip. Wc can now factorizc ip in an injcctive map Z/pZ — !■ R. If now we 
further assume that R has cardinality p, wc have that Z/pZ and R have same 
cardinality, and thus we have an isomorphism. This means that the only ring 
of cardinality and characteristic p is Z/pZ. 

Exercise 59. Let i? be a commutative ring. Let 

Nil{R) = {re R\3n > l,r" = 0}. 

1. Prove that Nil{R) is an ideal of R. 

2. Show that if r e Nil{R), then 1 — r is invertible in R. 

3. Show, with a counter-example, that Nil{R) is not necessarily an ideal 
anymore if R is not commutative. 



1. • Clearly, G Nil{R). If a G Nil{R), then a" = for some m > 1. 

Then (-a)™ = (-l)™a" = 0, so -a e Nil{R). Now let a, 6 G 
Nil{R), so a" = for some n > 1 and 6™ = for some m > 1. 
Now let us show that (a + 6)"+™ = 0. We have (a + 5)"+'" = 

n+m I 

> rra^fe"^"'^-' (because i? is commutative). Now if 

jl{n + m - jy. 

< j < n, we have n + m — j > to, so 6"+™^-' = in this case (since 
6™ = ^ 6* = for J > to) . If n + 1 < J < 71 + TO, we have j > n + 1, 
so = in this case (since a" = => a' = for i > n). Therefore 
all the terms in the previous sum are and thus (a + 6)"+™ = 0. 
Hence a + 6 G Nil{R). Wc just proved that Nil{R) is an additive 
subgroup of R. 

• Now we have to check the second property. Let a G Nil{R), and 
r E R. We have a" = for some n > 1. Now (ar)" = a"r" because 
R is commutative, so (ar)" = and therefore ar G Nil{R). Therefore 
Nil{R) is an ideal of i?. 

2. If r G Nil{R), then r™ = for some to > 1. Then 1 + r + r^ H h r™-^ 

is the inverse of 1 — r since 

(l-r)(l+r+r2+. • •+r"-i) = l+r+r^-+- ■ ■+r'"-^-r-r^+- ■ ■+r" = 1-r™ = 1. 
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3. If i? = AhiC), let a = ^ Q Q ^ and 6 = ^ ^ ^ ^ Then ^ ^ 

Q Q ^ , SO a, & e Nil{R), but a + 6 docs not lie in Nil{R), since (a + 
= /2, and I^ = h for all n>l. 

Exercise 60. Determine whether the following maps are ring homomorphisms: 

1. /i : Z — >Z with fi{x) = x + 1. 

2. /s : Z — ^ Z with /2(x) = x"^ . 

3. /s : Z/15Z — > Z/15Z with /3(.t) = 4a;. 

4. /4 : Z/15Z — > Z/15Z with fi{x) = &x. 
Answer. 

1. Since /i(0) = 1, /i, / cannot be a ring homomorphism. 

2. Since /2(.t + 2/) = + + 2xy ^ x'^ + y'^ ^ f2{x) + /2(?/), /2 cannot be 
a ring homomorphism. 

3. Since fz{xy) ~ Axy ^ xy ^ f3{^)f3{y)j fa cannot be a ring homomor- 
phism. 

4. Since fi{l) ^ 1, fi cannot be a ring homomorphism! 
Exercise 61. Let K be a division ring with center k. 

1. Show that the center of the polynomial ring K[X] is k[X]. 

2. For any a in K\k, show that the ideal generated by X — a in K[X] is in 
fact the whole ring 

3. Show that any ideal / C K[X] has the form K[X]h where h £ k[X]. 
Answer. 

1. Clearly k[X] is in the center. Conversely, if / = ^a^X' is in the center, 
then fa = af for all a € K, showing that a; S k. 

2. Fix b E K such that ab ^ ba. Then the ideal generated by X — a contains 

b{X -a) - {X -a)b = ab-baeK 
since ab ^ ba so {X — a) = R. 

3. We may assume 1^0 and fix a nionic polynomial of the least degree in 
/. By the usual Euclidean algorithm argument, we have that / = K[X]h. 
For any a E K, we have ha E I ~ K[X]h so ha = rh for some r in if [X]. 
By comparing the leading terms, we see that r E K and in fact r = a. 
Thus ha = ah for any a € K, which means that h G k[X]. 
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Exercise 62. Consider the ring A^„(]R) of real n x n matrices. Are the trace 
and the determinant ring homomorphisms? 

Answer. The trace is not multiplicative, since 
The determinant is not additive: 
Thus none of them are ring homomorphisms. 



4.2 Quotient rings 

Exercise 63. Compute the characteristic of the following rings R: 

1. R = Z„ = Z/nZ, 

2. i? = Z/2Z X Z/4Z X Z/IOZ, 

3. i? = Z[j]/(2 — where j denotes a primitive 3rd root of unity {j^ = 1 
butjVl). 

Answer. In this exercise, we use the notation x to denote an element in the 
quotient group involved. 

1. For 1 < m < n — I, wc have m • 1 = to 7^ 0, since to is not a multiple of 
n. But n ■ I = n = 0. So char(i?) = n by definition of the characteristic. 

2. If TO S Z, we will denote by respectively by to, [to], to its class modulo 2, 4 
and 10. Assume that to(T, [1], 1) = (0, [0], 0). Then we have 

(to, h» = (0, [0],0), 

which implies that to is a multiple of 2,4 and 10. Hence to is a multiple 
of the lowest common multiple of 2,4 and 10, which is 20. Conversely, 
20(T, [1], i) = (20, [20], 20) = (0, [0], 0). Therefore char(i?) = 20. 

3. Here we have (2 - 5j) (2 - 5j^) = 4 - 10(j + j^) + 25 f = 4+10 + 25 = 39. 
Hence 39 • T = 39 = (2 - 5j) • (2 - 5f ) = 0. Then the characteristic of 
R is finite and divides 39. Therefore the characteristic of i? is 1,3, 13 or 
39. Now let c = char(i?) > 0. Since c • Ijf lies in the ideal (2 — 5j), then 
c = (2 — 5j)(a + bj) for some a, b, e Z. Hence |c|^ = |2 — 5j|^|a + so 

c2 = 39(0^ + 6^ - ab) 

and therefore 39|c^. The only value (among 1,3,13 and 39) for which it 
is possible is c = 39. Thus char(_R) = 39. 
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Exercise 64. Prove the following isomorphisms: 

1. + ~ Z/2Z. 

2. Z[X]/(n,X) ~ Z/riZ, n > 2. 

3. Z[X]/(n) ~ (Z/nZ)[X], n > 2. 
Answer. 

1. Consider (^:mGZn-m-l/} = mG Z[z]/(1 + i). This is a ring 
homomorphism. It is smjcctivc. Indeed, let a + bi G Z[i]/(1 + i). We 
have a + bi ~ (6 — a) + a(l + i) = b ^ a, so a + hi = ip{b — a). Now 
ker(iy9) — c ■ where c = char(i?) by definition of the characteristic. By 
direct computation, we get char(i?) = 2 (since R is not the trivial ring and 
(l + i)(l — i) = 2). Therefore ker((^) = 2Z. Now use the first isomorphism 
theorem. 

2. Let us consider ip : P E Z[X] i-^ P{0) G Z/nZ. This is the composition 
of the ring homomorphisms P E Z[X] i-> P(0) G Z and in E Z i-^ fn E 
Z/nZ, so it is a ring homomorphism. It is surjective: for m e Z/rtZ, we 
have (p{m) = to, where to G Z C Z[X] is considered as a constant poly- 
nomial. Now we have ker(iy9) = {P E Z[X]\P{0) is divisible by n}, which 
equals {n,X). Hence ker((^) = {n,X); now applying the first isomorphism 
theorem, we get the result. 

3. Consider the reduction modulo n, ip : P E Z[X] ^ P E {Z /nZ)[X]. We 
have that ip is a ring homomorphism. It is surjective: let / e (Z /nZ)[X], 

/ = ao H + aniX"^, ai E Z. Then let P = oq H h a^X™ E Z[X]. By 

definition of P, we have '/'(-P) = /• Now let us compute the kernel of ip. 

Let P = ao H h a„X™. We have p{P) = oq H hOmX™ = 0. 

This is equivalent to say that = for all i, which means that n\ai for 
all i. This is equivalent to say that P = n • Q, for some Q E Z[X]. Hence 
ker(iy9) = (ti). Now apply the first isomorphism theorem. 

Exercise 65. Let A = C[X;(7] be the ring of all skew polynomials ^a,;X*, 
ai E C, where multiplication is defined by Xa = a{a)X for all a E C, and a is 
the complex conjugation on C. 

• Show that the center Z{A) of A is Z{A) = R[X'^]. 

• Show that A = A/A{X^ + 1) is a ring. 

• Show that A is isomorphic to H, the division ring of Hamilton quaternions. 
Answer. 

• Note that X^a = Xa{a)X = (T^(a)X^ and more generally 
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Now if ^ bjX^ is in the center, then we must have 

i j 

thus X^ must be an even power of X so that when anti-commute with 
X^ , (ai) ~ ai since a is of order 2. Furthermore, we must have that 
cr''{bj) ~ bj for any i, showing that bj must be real, which shows that 
the center is (More formaUy, one can take a polynomial in the 

center, say p{X), and compute p{X)a — ap{X) for any a G C, which 
shows that p{X) £ C[X'^], then compute p{X)X = Xp{X) which shows 
that piX) e R[X^]). 

• For this quotient to be a ring, we need the ideal A{X'^ + 1) to be two-sided. 
This is the case since X^ + 1 belongs to the center by the point above. 

• We can express the ring of Hamilton quaternions H in the form H = C®Cj, 
and define 

ip: A^m, ip{X) ^ j, ip{a) = a, a e C. 

Since ja — cr{a)j in H for any a £ C, ip gives a ring homomorphism 
from A to H. This induces a ring homomorphism ip : A ^ M since 
ip{X^ + 1) = f + 1 = 0. Since 

(p(a + bX) = a + bj, 

Cp is an isomorphism. (This is the first isomorphism theorem for rings.) 

4.3 The Chinese Remainder Theorem 

Exercise 66. Show that the following rings arc isomorphic: 

Z/72Z X Z/84Z ~ Z/36Z x 168Z. 

Answer. We have that 72 = 8 • 9 and gcd(8,9) = 1, thus Z72 ~ Zg x Zg. 
Similarly Z84 ~ Z4 x Z3 x Z7, Z36 ~ Z4 x Zg and Zigs ~ Zg x Z3 x Z7. Thus 

Z72 X Z84 ~ Zg X Zg X Z4 X Z3 X Z7 

~ Zg X Z36 X Z3 X Z7 

— ^36 X ^128- 

Exercise 67. Show that 10^^ + 1 is a multiple of 247. 
Answer. We have that 

100= 12-8 + 4 

thus 

10^°" = (10^2)8 . ;^q4 ^ ^q4 ^ ^_3)4 = 3 = _io mod 13 
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where the second equaHty uses that ^ = 1 mod p. Similarly (100 = 18-6 — 8) 

10^°" = 10"^ = 2* = 9 = -10 mod 19. 
By the Chinese Theorem, we deduce that 

10^"° = -10 mod 247. 
Since gcd(l 0,247) = 1, we can simphfy by a factor of 10, and get 

10^^ = -1 mod 247 

and thus 247 | 10^^ + 1. 

Exercise 68. The battle of Hasting (October 14, 1066). "The men of Harold 
stood well, together, as their wont was, and formed thirteen squares, with a 
like number of men in every square thereof, and woe to the hardy Norman who 
ventured to enter thier redoubts; for a single blow of a saxon warhatched would 
break his lance and cut through his coat of mail... When Harold threw himself 
into the fray the Saxon were one mighty square of men, shouting the battle-cries 
'Ut!', 'Olicross!', 'Godemite!'." 

How many men were there in the army of Harald Hardrada? (This exercise 
is courtesy of C. Wuthrich). 

Answer. The men of Harald formed thirteen squares, that is 13x^, when Harold 
threw himself into the battle (+1), they were one mighty square of men (y^). 
This gives the equation 

^ 13x^ + 1. 

We then have to look for the smallest integer solution. Using field theory instead, 
one can rewrite this equation as 

1 = iy-VT3x){y + Vl3). 

We are thus looking for an element y + \/lSx of K = (!2(\/l3) which satisfies 
this equation. One can show that r] ~ ^+'^^ satisfies this equation up to a sign 
— 1, thus T] with an even power satisfies it, and rj and its powers are actually the 
only elements in K to satisfy it. We thus need to take an even power of 77 which 
will give us an element in the ring Z[Vl3]. We find that 77^ = 649 + 180A/r3 is 
the first power to satisfy this condition. Finally, the smallest integer solution to 
the equation y^ ~ ISx^ + 1 is a: = 180 and y =■ 649, that is, there were 42T200 
men with Harald Hardrada. It is however known that his army was instead 
containing about 7'500 men. 

4.4 Maximal and prime ideals 

Exercise 69. Show that a non-zero principal ideal is prime if and only if it is 
generated by a prime element. 
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Answer. If p is prime then consider the principal ideal pR = {pr, r <E R}. To 
show that pR is prime, we have to show that if ab G pR then either a or 6 is in 
pR. If ab £ pi?, then ab = pr for some r (z R. Since p is prime, it has to divide 
either a or b, that is either a = pa' or 6 = pb' . Conversely, take a principal ideal 
cR which is prime, thus if ab e cR, cither a G cR, that is a = ca' , or & G cR, 
that is 5 = cb' . We have thus shown that if c\ab, then c\a or c\b. 

Exercise 70. Are the ideals {X,X + 1), {5,X'^ + 4) and (A^ + 1,A + 2) 
prime/maximal in Z[A]? 

Answer. 

• I = (A, A + 1) = Z since 1 = (A + 1) — A, thus / is not a proper ideal 
and cannot be prime. 

• Consider Z[A]/(5,A2+4) ~ Z5[A]/(A2+4), and (A2+4) = (A-1)(A+1) 
is reducible modulo 5, thus this quotient is not an integral domain and 
thus the ideal is not prime. 

• / = (A2 + 1, A + 2) = (A + 2, 5) since (A + 2f - 4(A + 2) + 5 = A^ + 1, 
then Z[A]// ~ Z5[A]/(A + 2) where A + 2 is irreducible in Z5[A] thus 
the quotient is a field and / is maximal. 

Exercise 71. 1. Consider the ring R = Z[i] and the ideal / = (1 + 1) in R. 
Is / prime? Is / maximal? 

2. Consider the ring R = Z[j'] and the ideal J = (2 — rj) in R. Is / prime? 
Is / maximal? (j is a primitive 3rd root of unity.) 

3. Consider the ring R = Z[A] and the ideal / = (n) in R. Is / prime? Is / 
maximal? 

Answer. 

1. We have Z[z]/(l + i) ~ Z/2Z, which is a field, so (l + «) is maximal (hence 
prime). 

2. The characteristic of Z[j]/(2 — 5j) is 39 which is not a prime number (see 
Exercise 63), so Z[j]/ (2 — 5j) is not an integral domain. Hence (2 — 5j) is 
not prime and therefore not maximal. 

3. We have Z[A]/(?i) ~ Z/7iZ[A]. We have that Z/nZ[A] is an integral 
domain if and only if Z/nZ is an integral domain. Hence (n) is a prime 
ideal if and only if n is a prime number. It is never maximal since Z /n Z[A] 
is not a field for any n (A has no inverse). 

Exercise 72. Consider the ring R = K[X] and the ideal of R given by / = 
(A — a), where K is a field, and a G K. Is / maximal? Is / prime? 

Answer. Let (p : P G A' [A] M' P{a) E K. This is a ring homomorphism, 
which is surjective: indeed, if A e A', then (^(A) = A, where A G A' C A[A] 
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is viewed as a constant polynomial. We now determine the kernel of (f. Let 
P e K[X]. We can write P = QiX).{X - a) + c, for some Q e K[X] and 
c e K. (Indeed, it suffices to proceed to the division of P hy X — a. The 
remainder is either zero or has degree < 1, that is degree 0, which means that the 
remainder is a constant.) Then we have P{a) = Q(a).{a — a) + c = c. Therefore, 
(p{P) = ■^=> c = <;==> P is a multiple oi X — a. Hence kei{(p) = {X — a) 
(the principal ideal generated hy X — a). Using the first isomorphism theorem, 
we get that K[X]/{X -a) 2± K. Since K[X]/{X -a) 2:^ and if is a field, 
then K[X]/{X — a) is a field as well and {X — a) is maximal (hence prime). 

Exercise 73. Let i? be a commutative ring. Let 

Nil{R) = {r e R\3n > l,r" = 0}. 

1. Show that Nil{R) is contained in the intersection of all prime ideals of R. 

2. Show that Nil{R/Nil{R)) = 0. 
Answer. 

1. Let a e Nil{R), so a" = for some ?i > 1. Assume that there is a prime 
ideal p for which a ^ p. We have a" = G p. Since a" = a"^^.a and p is a 
prime ideal, then a"^^ G p or a G p. By assumption on a, we have a ^ p, 
so necessarily a"^^ G p. But a"^^ ~ a^^^.a G p, so a"~^ G p for the same 
reasons, and by induction we get a G p, a contradiction. Therefore a lies 
in all the prime ideals of R. 

2. Let a G Nil{{R/Nil{R))), so a" = for some n > 1. Then ^ = 0, which 
means that a" G Nil{R) by definition of the quotient ring. Therefore, 
there exists m > 1 such that (a")™ = 0, so a""* = 0, which means that 
a G Nil{R). Hence a = 0. 

Exercise 74. Let R = 1[X], and let n > 1. 

• Show that the ideal {n^X) is given by 

{n,X) = {p{X) G p(0) is a muhiple of n). 

• Show that (n, X) is a prime ideal if and only if n is a prime number. 
Answer. 

• Let P G (n,X), so P ^ n.Qi + X.Qi for some Qi.Q^ G 1\X\. Then 
P(0) = 7i.(3i(0) G ?iZ (we have Qi(0) G Z since Qi G Z[X]), that is 
P(0) is a multiple of n. Conversely, assume that P G Z[X] is such that 
P(0) is a multiple of n, and write P = a„X" + • • • + oiX + oq. Then 
P(0) = ao, so by assumption oo = n.m for some m G Z. Now we get 
P = n.TO + X(a„X"-i H h + ai), so P G [n,X). 



140 



CHAPTER 4. EXERCISES ON RING THEORY 



• If n is not a prime number, then we can write n = ni.n2, 1 < ni, n2 < n. 
Now consider Pi = ni,P2 = n2 € (constant polynomials). We have 

P1.P2 = ni.n2 = n E {n,X), but Pi and P2 are not elements of {n,X). 
Indeed, ^"1(0) = ni and ^2(0) = 712, but ni,n2 are not multiples of n 
by definition. Hence (n, X) is not a prime ideal. Now assume that n is 
equal to a prime number p. First of all, {p, X) 7^ because 1 ^ (p, X) 

for example. Now let Pi,P2 G such that P1.P2 € Then 

(Pi.P2)(0) is a multiple of p by the previous point, that is p|Pi(0).P2(0). 
Since p is a prime number, it means that p|Pi(0) or p|P2(0), that is Pi S 
{p,X) or P2 G {p,X). Hence {p,X) is a prime ideal. 

4.5 Polynomial rings 

Exercise 75. Set 

P = e I p(0) is even }, P = {(7(X) G Z[X] | q{0) = 0(mod 3)}. 

Check that E and P arc ideals of Z[X] and compute the ideal E + F. Further- 
more, check that P • P C {p{X) G Z[X]\p{0) = (mod 6) }. 

Answer. If p(X) = ELo^fe^''' t^cn 

P = {p(X) e Z[X] I po e 2Z} and P = {g(X) e Z[X] | go e 3Z}. 

Thus P and P are ideals of Z[X] since 2Z and 3Z are ideals of Z. If X^fe "^fe^*^ = 
{Y.kPk^'') ■ (Lfc*^''): then Co = po^o and thus 

p • p c {p{x) e Z[X] I po e 2Z • 3Z} {p(x) e z[x] \ po e 6Z}. 

Similarly, 

P + P = {p(x) e Z[X] I Po e 2Z + 3Z} = {p(x) e z[x] I Po e z} = z[x]. 



Exercise 76. Show that if P is a field, the units in F[X] are exactly the nonzero 
elements of P. 

Answer. Let f{X) G F[X] of degree ri, /(X) is a unit if and only if there exists 
another polynomial g(X) E F[X] of degree m such that f{X)g{X) = 1. Because 
P is a field (thus in particular an integral domain), f[X)g{X) is a polynomial 
of degree n + m, thus for the equality to hold, since 1 is a polynomial of degree 
0, we need 71 + m = 0, thus both / and g are constant, satisfying fg = 1, that 
is they are units of P, that is nonzero elements since P is a field. 

Exercise 77. There exists a polynomial of degree 2 over Z/4Z which has 4 
roots. True or false? Justify your answer. 

Answer. Take the polynomial 2X{X — 1). 
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Exercise 78. Let i? be a ring, and let a G R such that there exists an 
integer n with a" = 0. Show that R* C {R[X])* and R* ^ R[X]* , where R* 
and denote respectively the group of units of R and R[X]. 

Answer. Clearly R* C Wc need to show tliat the inclusion is 

strict, that this, there exists an element in R[X]* which is not in R* . Take 
f{X) = l- aX. We have 

(1 - aX){l + aX + {aXy + ... + iaX)"-^) = 1, 

and / docs not belong to R* . 

Exercise 79. Let be a field. Consider the ring /ir[X, y] of polynomials in 
indeterminates X and Y with coefficients in K. 

1. Is K[X,Y] an integral domain? 

2. What are the units of K[X, Y]7 

3. Consider the ideals Ii = {X) and X2 = {X,Y). Are they prime ideals of 
K[X, Y]7 

4. Show that J = {f e K[X, Y], /(O, 0) = 0} is an ideal. 

5. Deduce using J' that K[X, Y] cannot be a principal ideal domain. 
Answer. 

1. Yes it is. It is a commutative ring (since K is a field). Furthermore, it has 
no zero divisor, since K as none. 

2. So units of /^[X, F] are polynomials / G F] such that there exist 
g £ K[X,Y] with fg ~ 1. Thus the degree of the polynomial fg is 0, 
and both f,g must be constant polynomials (since if is a field). Thus the 
units are those of K. 

3. Both of them are for the same reason: K[X, Y]/Ii ~ K[Y] and K[X, Y]/l2 ~ 
K, both of them are integral domains, thus both ideals are prime. 

4. Take f,g e J, then f — g belongs to J, and if h is in K[X, Y], we also 
have that hf G J'. 

5. Assume there exists / e A'[A, y] such that (/) = J'. Note that both 
X and Y belong to J^. Thus there must exist g,h £ K[X,Y] such that 
X = f{X,Y)g{X,Y) and Y = f{X,Y)h{X,Y). Since X is of degree 1, 
and Y is of degree 1, we should have f{X, Y) = aX+bY. But now, if a ^ 0, 
Y = h{X, Y){aX + bY) is not possible, and if 7^ 0, A = /(A, Y)g{X, Y) 
is not possible cither. 
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4.6 Unique factorization and Euclidean division 

Exercise 80. 

Show that the ideal generated by 2 and X in the ring of polynomials Z[X] is 
not principal. 

Answer. We have that 

(2, X) = {2r(X) + Xs{X), riX), s{X) € Z[X]}, 

and assume there exists f{X) E U\X\ such that {2,X) = {f{X)). Since 2 G 
(/(X)), then ,f{X) = ±2. Since X G (/(X)), we should have X = ±25(X), a 
contradiction. 

Exercise 81. Let R be an integral domain in which every decreasing chain of 
ideals is finite. Show that i? is a field. 

Answer. Let x € R, x ^ 0. Then (x) D (x^) D (x^) D ... is a decreasing 
chain of ideals. It thus stabilizes at some point by assumption, that is, there is 
a A: in N such that [x^) — In particular, there is an element a E R such 

that ax^^^ = x^ . Since R is an integral domain, we have ax = 1, and thus x is 
invertible, showing that R without the element is a field. 

Exercise 82. Show that if i? is a unique factorization domain, then R[X] is 
also a unique factorization domain. 

Answer. Let us write f{X) = oq + aiX + a2X'^ + • • • + a„X", aj £ R. Recall 
that c(/) is the content of / defined as gcd(ao, . . . , a„). We need to check that 
a factorization exists, and that it is unique. 

• Existence: if p e i? is irreducible, then p is also irreducible in If 
f{X) G R[X], we can write f{X) = df{X) be factoring the content d, 
so that c(/) = 1. We can factor d into a product of irreducible in R. 
Now either / is irreducible in or it factors properly into a product 
of lower degree polynomials (c(/) = 1). All the factors will also have 
content 1, and we can only lower degree of factors finitely often, so wc get 
a factorization of /, and thus one for / as product of irreducibles in 

• Uniqueness: by Theor 2.15, it suffices to prove that each irreducible el- 
ement is prime, which we can do by proving that each irreducible el- 
ement generates a prime ideal in R[X]/ If p S i? is irreducible, then 

R[X]/pR[X] = {R/p)[X] which is an integral domain. 

Exercise 83. Let be a field, let f{X),g{X) G F[X] , and let d{X) be a 
greatest common divisor of f{X) and g{X). Show that there arc polynomials 
u{X),v{X) € F[X] such that 



d{X)=u{X)f{X)+v{X)g{X). 
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When does Bezout identity hold more generally? 

Answer. Bezout identity works for general PID as follows (and thus in partic- 
ular for Take a,b G R, where i? is a PID. Consider the corresponding 
principal ideals aR and bR, we have that 

aR + bR = cR 

simply because R is a PID. Since aR C ci?, c\a and for the same reason c\b. 
Now consider d = gcd(a, b), then d\a and d\b, and thus conversely dR contains 
aR and bR and thus cR, showing that d\c. But c must also divide d, showing 
that c = d, that is 

aR + bR = gcd(a, b)R, 

in words, gcd(a, b) is some linear combination of a and b using coefficients in R. 
This does not work for arbitrary UFDs. For example, in the polynomials 

X and 2 arc coprimc, but no linear combination of 2 and X gives 1. For more 
generalization of this notion, check the definition of Bezout domain. 

Exercise 84. Show that Z[-\/3] is a Euclidean domain. (Hint: use the same 
technique as the one seen for Z[-\/2].) 
Answer. Consider the ring 

Z[^/3] = {a + 6^/3, a, 6 e Z} 

with 

*(a + 6\/3) = \a^-ib\ 
Take a, (3 ^ in Z[\/3], and compute the division in Q(V3): 

with q' = X + a/Sj/ with x, y rational. Let us now approximate x, y by integers 
xo,yo, namely take xo,2/o such that 

\x-xo\ < 1/2, \y-yo\ < 1/2. 

Take 

q = xo + yoVs, r = l3{{x - xq) + {y - yo)V3), 
where clearly q G Z[-\/3], then 

/3q + r = l3{xo + yoVs) + I3{{x - xq) + {y - yo)V3) 
= Pix + yVS) = Pq' = a, 

which at the same time shows that r £ Z[y/3]. So far this is exactly what we 
did in the lecture. We are also left to show that '^{t) < ^(/3). We have 

M'(r) = ^ip)^iix~xo) + iy-yo)Vd) 
= vI/(/3)|(a;-xo)2-d(y-2/o)'| 
< ^{mx~xo\' + \d\\y-yo\'] 
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though here we notice that we get | + 13| ^ = 1. So this is not good enough! But 
let us see what this means to get 1: this happens only if |a; — xqP = |y — = 
1/4, otherwise we do get something smaller than 1. Now if |a; — xqP = jy — = 
1/4, we have from the second equation that 

vf = ^m{x - xof - d{y yofl = - J| < 1 

and we are done. 

Exercise 85. The goal of this exercise is to show that a principal ideal domain 
is a unique factorization domain in which every prime ideal is maximal. (Hint: 
To show that every prime is maximal, take a prime ideal I and a maximal ideal 
Ai, and sec what it means for I to be included in Ai in a PID). Note that the 
converse is true. 

Answer. If we have a PID, it is a UFD (this is far from obvious, this was shown 
in the notes). We have to show that every prime ideal is maximal. Take X a 
prime ideal, and M a maximal ideal. Thus X C by maximality of M. Now 
since we have a PID, we can write I = (a), M. = (m) and (a) C (m) showing 
that m\a. Thus a = md for some d. But now a is prime (this follows from 
(a) being prime, see Exercise 69) thus it is irreducible (in a UFD, irreducible 
and prime are equivalent). Since a is irreducible, either m or d is a unit, and 
m cannot be (otherwise would be i?, which is impossible by definition of 
maximal ideal), thus d is a unit. Then a and m are associate, so they generate 
the same principal ideal, and X = M.. 

4.7 Irreducible polynomials 

Exercise 86. Prove whether the following polynomials are reducible/irreducible 
over F . 

1. -2, F = Q. 

2. lt^ + lt'^ + t^ + l,F = Q. 

3. + 15i"^ + 7, F = Z, hint: think of modulo. 

4. t'^'^ + t'^'^ + t^-^ + ... + t'^ +t^ +t+l, F ^ Q, hint: this needs a trick. 
Answer. 

1. Use Eisenstein's criterion with p ~ 2. 

2. This polynomial is irreducible if and only if 

9 fit) = 2t^ + 15t** + 9t^ + 3 

is irreducible over Q. Here Eisenstein's criterion can be applied withp = 3, 
showing that / is irreducible. 
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3. Modulo 5, f{t) = t'^ + 2. If this is reducible, then either it has a a factor 
of degree 1 (not possible, it is easy to try the 5 values), or it is a product 
of two factors of degree 2. The latter can be checked explicitly: if 

t'^ + 2 = {t^ + at + b){f + ct + d) 

then a + c = 0, ac + & + d = 0, hd — 2. One can check all possible values 
and see that this is not possible either. Hence + 2 is irreducible modulo 
5, and therefore the original polynomial was irreducible over Z. 

4. Notice that f{t) is irreducible if and only if f{t + 1) is. By expanding 
f{t + 1), one can use Eisenstein's criterion with q = 17. 

Exercise 87. True/False. 

Ql. Let i? be a ring, and let r be an element of R. If r is not a zero divisor of 
R, then r is a unit. 

Q2. A principal ideal domain is a euclidean domain. 

Q3. Hamilton's quaternions form a skew field. 

Q4. The quotient ring Z[t]/(1 + i)Z[i\ is a field. 

Q5. A field is a unique factorization domain. 

Q6. The ideal (5, i) in is principal. 

Q7. The polynomial Sx^ + 15X^ + 10 is irreducible over Q. 

Q8. Let i? be a ring, and M be a maximal ideal, then R/M is an integral 
domain. 

Answer. 

Ql. This cannot be true in general! Take Z for example. It has no zero divisor, 
but apart 1 and -1, no other element is a unit! Actually, in an integral 
domain, there is no zero divisor, which does not mean it is an field. 

Q2. A euclidean domain is a principal ideal domain. The converse is not true. 
Take for example Z[(l + iy/l9)/2]. It is a principal ideal domain, but it is 
not a euclidean domain. 

Q3. A skew field is non-commutative field. Hamilton's quaternions are non- 
commutative, and we have seen that every non-zero quaternion is invert- 
ible (the inverse of q is its conjugate divided by its norm). 

Q4. It is actually a field. You can actually compute the quotient ring explicitly, 
this shows that Z[i]/(1 -I- i)'E[i] is isomorphic to the field of 2 elements 
{0, Ij.This can be done using the first isomorphism for rings. 

Q5. It is true since every non-zero element is a unit by definition. 
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Q6. It is true! With no computation, we know it from the theory: We know 
that Z[z] is a euclidean domain, and thus it is a principal domain, so all 
ideals including this one are principal. 

Q7. It is true! Use for example Eisenstein's criterion with p = 5. 

Q8. Who said the ring R is commutative? The statement seen in the class is 
about commutative rings. It is not true for non-commutative rings. Here 
is an example: take R ~ Z + Zi + Zj + Zfc (ring of quaternions with 
integer coefficients), pR is a maximal ideal of R (p odd prime) but R/pR 
is actually isomorphic to M2{'Z/p1j) and thus is not an integral domain. 
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Abstract field theory emerged from three theories, which we would now call 
Galois theory, algebraic number theory and algebraic geometry. 

Field theoretic notions appeared, even though still implicitly, in the modern 
theory of solvability of polynomial equations, as introduced by Abel and Galois 
in the early nineteenth century. Galois had a good insight into fields obtained 
by adjoining roots of polynomials, and he proved what we call now the Primitive 
Element Theorem. 

Independently, Dedekind and Kronecker came up with the notion of alge- 
braic number fields, arising from three major number -theoretic problems: Fer- 
mat's Last Theorem, reciprocity laws and representation of integers by binary 
quadratic forms. 

Algebraic geometry is the study of algebraic curves and their generalizations 
to higher dimensions, namely, algebraic varieties. Dedekind and Weber carried 
over to algebraic functions the ideas which Dedekind had earlier introduced for 
algebraic numbers, that is, define an algebraic function field as a finite extension 
of the field of rational functions. 

At the end of the nineteenth century, abstraction and axiomatics started to 
take place. Cantor (1883) defined the real numbers as equivalence classes of 
Cauchy sequences,von Dyck (1882) gave an abstract definition of group (about 
thirty years after Cayley had defined a finite group). Weber's definition of a 
field appeared in 1893, for which he gave number fields and function fields as 
examples. In 1899, Hensel initiated a study of p-adic numbers, taking as starting 
point the analogy between function fields and number fields. It is the work of 
Steinitz in 1910 that initiated the abstract study of fields as an independent 
subject. A few examples of his results are: classification of fields into those 
of characteristic zero and those of characteristic p, development of the theory 
of transcendental extensions, recognition that it is precisely the finite, normal, 
separable extensions to which Galois theory applies, proof of the existence of 
the algebraic closure of any field. 
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Major developments in field theory and related areas that followed Steinitz's 
work include valuation theory, class field theory, infinite Galois theory and finite 
fields. 

5.1 Field extension and minimal polynomial 

Definition 5.1. If F and E arc fields, and F C E, wc say that E is an extension 
of F, and we write cither F < E oi E/F. 

Examples 5.1. Here are some classical examples: 

1. C = {a + hi, a, 6 e M} is a field extension of R. 

2. Q(\/2) = {a + bV2, a, 6 e Q} is a field extension of Q. 

3. Q{i) = {a + bi, a,b E Q} is a field extension of Q. 

If E is an extension of F, then in particular E is an abelian group under 
addition, and we may multiply x £ E hy X E F. We can see that this endows E 
with a structure of F-vector space (the elements of E are seen as vectors, those 
of F as scalars). It then makes sense to speak of the dimension of E over F. 

Definition 5.2. Let E/F he a. field extension. The dimension of E as F- vector 
space is called the degree of the extension, written [E : F]. If [F : F] < oo, we 
say that F is a finite extension of F, or that the extension E/F is finite. 

Let us get back to our examples: 

Examples 5.2. 1. Consider the field extension C/M. We have that C is a 
vector space of dimension 2 over M. It is thus an extension of degree 2 
(with basis {1, i}). 

2. The field extension Q(a^/(2))/Q is of degree 2, it is called a quadratic 
extension of Q. 

3. The field extension Q{i)/Q is a also a quadratic field extension of Q. 

4. Both Q(VT2))/Q and Q(i)/Q are finite field extensions of Q. Finite ex- 
tensions of Q are called number fields. 

If wc look at C, wc sec it is obtained by adding i to R, and i is a root of 
the polynomial + 1. Similarly, Q(\/2)/Q is obtained by adding a root of 
the polynomial — 2. In what follows, wc will make formal the connection 
between roots of polynomials and field extensions. 

Before we start, recall that if we have two fields E,F and a field homomor- 
phism between them (that is, a ring homomorphism between two fields), then 
/ is a monomorphism. We have seen the argument in the previous chapter 
already: the kernel of a ring homomorphism is an ideal, and a field has only 
trivial ideals, namely {0} and itself, and it cannot be that the whole field is the 
kernel. 
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Theorem 5.1. Let f be a non- constant polynomial over a field F. Then there 
is an extension E/F and an element a E such that f{a) = 0. 

Proof. Recall that F[X] is a unique factorization domain, thus / can be factored 
into a product of irreducible polynomials, and we may assume without loss of 
generality that / is itself irreducible. Consider now the ideal 

in F[X], the ring of polynomials with indeterminate X and coefficients in F. 
Again using that F[X] is a unique factorization domain, we have that f{X) is 
irreducible and equivalently prime, implying that {f{X)) is prime. Now F[X] is 
furthermore a principal ideal domain. This means that I = {f{X)) is contained 
in a principal maximal ideal so that q{X) divides the prime f{X). Since 

f{X) = q{X)g{X) for some g{X), and q{X) cannot be a unit because f(X) is 
irreducible, f{X) and q{X) are associates, and {f{X)) = {q{X)), proving that 
{p(X)) = I is maximal. Thus by the characterization of maximal ideals with 
respect to their quotient ring, we have that 

E = F[X]/I 

is a field. Wc now place an isomorphic copy of F inside E via the monomorphism 

h : F E, at-^ a + 1. 
This thus gives a field extension E/F. Now let 

a ^ X +1 e E. 

We are left to prove that a is a root of f{X). If f{X) — ao + aiX + . . . + anX^, 
then 

/(a) = {ao+I) + ai{X +!) + ... + a„{X +1)" 

= ao+I + aiX + a{I + . . . + a„X" + . . . + a„T" 
^ {aa+aiX + ... + a„X") + X 
- f{X)+I 

which is zero m E. □ 

The extension E is sometimes said to be obtained from F by adjoining a 
root of /. 

Remark. Note that in the above proof, we have shown that a prime ideal in a 
principal ideal domain is maximal. 

Definition 5.3. If E is an extension of F, an element a G is said to be 
algebraic over F if there is a non-constant polynomial / G F[X] such that 
f{a) = 0. If a is not algebraic over F, it is said to be transcendental over F. 
If every element of E is algebraic over F, then E is said to be an algebraic 
extension of F. 



150 



CHAPTER 5. FIELD THEORY 



Suppose that a € E is algebraic over F. Thus there exists by definition a 
polynomial / € F[X] with f{a) = 0. It thus makes sense to consider the set I 
of all polynomials g G F[X] such that g{a) ~ 0. Clearly 

• if .91, (?2 BiTC in I, so does gi± g2, 

• a g el and h e F[X], then gh e I. 

This tells us that I = {g e F[X], g{a) = 0} is an ideal of F[X]. 
Since F[X] is a principal ideal domain, we have 

I={m{X)) 

for some m{X) in Any two generators of I are thus multiple of each 

others, so they must be of same degree, and since m(X) is monic, it has to be 
unique. This polynomial m{X) has the following properties: 

1. If 5 e -P"!-'^]! then g{a) = if and only if m{X) divides g{X). This is clear 
from the definition of T. 

2. m{X) is the monic polynomial of least degree such that m.{a) = 0, which 
follows from the above property. 

3. m{X) is the unique monic irreducible polynomial such that m{a) — 0. 
Indeed, if m{X) = h{X)k{X) with deg/i < degm, degfc < degm, then 
either h{a) = or k{a) ~ 0, so that either h{X) or k{X) is a multiple of 
m{X) by the first property, which is impossible. Thus m(X) is irreducible. 
We are left to prove the unicity of m{X). This comes from the fact that 
since m{X) is monic, then if there were two irreducible monic polynomials 
m{X) and m'{X) such that m{a) = m'{a) = 0, they have a as common 
root, and thus m^X) and m!{X) cannot be distinct (see the proposition 
below) . 

Definition 5.4. The polynomial ra{X) is called the minimal polynomial of a 
over F. It may be denoted by min(Q;, F) or ^a.F- 

Example 5.3. The polynomial X^ + 1 is the minimal polynomial of i over Q. 
It also the minimal polynomial of i over R. 

Proposition 5.2. 1. Let f and g be polynomials over the field F. Then f 
and g are relatively prime if and only if f and g have no common root in 
any extension of F. 

2. If f and g are distinct monic irreducible polynomials over F , then f and 
g have no common roots in any extension of F. 

Proof. 1. If f and g are relatively prime, their greatest common divisor is 1, 
so there are polynomials a{X) and b{X) over F such that 



a{X)f{X)+biX)g{X) = l. 
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If there is a common root say a, then we get that = 1, a contradiction. 

Conversely, let us assume that the greatest common divisor d{X) of f{X) 
and g{X) is non-constant and shovif that then f(X) and g(X) have a 
common root. By the above proposition, there exists E an extension of F 
in which d{X) has a root a. Since d{X) divides both f{X) and g{X), a 
is a common root of / and g in E. 

2. By the first part, it is enough to show that / and g are relatively prime. 
Assume to the contrary that ft, is a non-constant divisor of the polynomials 
/ and g which are irreducible. Then / = f'h and g = g'h with f ,g' non- 
zero constant, and h = jj = y, that is, f ~ yg. It is impossible for / to 
be a constant multiple of g, because / and g are monic and distinct. 

□ 

If E is an extension of F and a G E is a root of a polynomial / G ^'t-''^], 
one may consider the field F{a) generated by F and a, which is the smallest 
subficld of E containing both F and a. Alternatively, F{a) can be described 
as the intersection of all subficlds of E containing F and a, or the set of all 
rational functions 

ao + aia + • • • + ama™ 
bo + bala + . . . + 6„a" 
with ai,bj Cz F, m,n = 0,1, .. . and the denominator is different from 0. 

Theorem 5.3. Let a Cz E be algebraic over F, with minimal polynomial m(X) 
over F of degree n. 

L We have F{a) — F[q\ = _F'„_i[q;] where i^„_i[a] denotes the set of all 
polynomials of degree at most n — 1 with coefficients in F . 

2. {1, a, ... , q;"~^} forms a basis for the vector space F{a) over the field F. 
Consequently [F(a) : F] — n. 

Proof. Let us first prove that i^„_i[Q!] is a field. Let f{X) be any non-zero 
polynomial over F of degree at most n — 1. Since m{X) is irreducible with 
deg / < deg m, f{X) and m{X) are relatively prime, and there exist polynomials 
a{X) and b{X) over F such 

a{X)f{X) + b{X)m{X) = 1. 

Using that a is a root of m, we get 

so that any non-zero element of i^„_i[Q;] has an inverse, and F„_i[q;] is a field. 

1. Any field containing F and a must contain all polynomials in a, and in 
particular all those of degree at most n — 1. Thus 



F„_i[a] C F[a] C F{a). 
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But F{a) is the smallest field containing F and a, so 

F{a) C F„_i[a] 

and we conclude that 

Fia) = F[a] ^ F„-i[a]. 

2. Now 1, a, ... , a"~^ certainly span Fn-i[a], and they are linearly indepen- 
dent because if a non-trivial linear combination of them were zero, this 
would yield a non-zero polynomial of degree less than that of m{X) with 
a as a root, a contradiction. 

□ 

Example 5.4. Let ^5 denote a primitive 5th root of unity (that is, Cf = 1 
C5 7^ 1 for 1 < fc < 4). We have that Cs e QiCb) is algebraic over Q, with 
minimal polynomial + + + X + 1 ^ of degree 4 over Q. A Q-basis 
is given by {1, Cs, C|, ClI and [Q(C5) : Q] = 4. 

Once we have a field extension K/F, we can take again K as base field and 
get another field extension E/K, yielding a tower of extensions E/K/F. 

Proposition 5.4. Consider the field extensions E/K/F . 

1. If at, i G /, form a basis for E over K, and /3j, j ^ J form a basis for K 
over F , then UiPj, i € I , j (z J , form a basis for E over F. 

2. The degree is multiplicative, namely 

[E : F] = [E : K][K : F]. 
In particular, [E : F] is finite if and only if [E : K] and [K : F] are finite. 
Proof 1. Take j e E. Then 

= IIE^^j/^j)"^: b,,€F. 

Thus ai/3j span E over F. We now cheek the linear independence. 

KjO-iPj = ^ ^ Xi^ai = 

i.j i 

for all i and consequently = for all i,j which concludes the proof. 
2. It is enough to use the first part, with 

[E:K] = \I\, [K:F]^\J\, [E : F] ^ \I\\J\. 

□ 
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Example 5.5. Consider the field extension Q(C8)/Q where Cs is a primitive 8th 
root of unity. We have that 

^« = ^+*^ 

and Q{Cs)/Q is the same field extension as Q{i, \/2)/Q. We have 

x/2) : Q] = [Q(z, x/2) : Q(x/2)] [Q(V2) : Q] = 2 • 2 = 4. 

Recall that an algebraic extension is a field extension where every element 
is algebraic. The result below describes families of algebraic extensions. 

Theorem 5.5. If E is a finite extension of F, then E is an algebraic extension 
ofF. 

Proof. Let a E E with degree [E : F] = n. Then 1, a, . . . , a" are n + 1 elements 
while the dimension is n, so they must be linearly dependent, say 

flo + fliOf + . . . + a„Q!" = 0, Oi e F. 

Takep(X) = ao+aiX + . . .+a„X" e a is a root oip{X) and by definition 

a is algebraic over F . □ 

Examples 5.6. 1. By definition, a number field is a finite extension of Q. 
Thus a number field is an algebraic extension of Q. 

2. The converse is not true. There arc infinite algebraic extensions, for ex- 
ample, the field of all algebraic numbers over the rationals is algebraic and 
of infinite degree. 



5.2 Splitting fields and algebraic closures 

For a € E, an extension of F, we have introduced above F(a) as the intersection 
of all the subfields of E containing F and a. This can be of course generalized if 
wc pick tti, . . . ,aj~ & E, and F{ai, . . . , ai^) is the intersection of all the subfields 
of E containing F and ai, . . . , a^. 

Definition 5.5. If E is an extension of F and / £ we say that / splits 

over E ii f can be written as X{X — ai) ■ ■ ■ {X — ak) for some ai, . . . , e E 
and Xe F. 

Definition 5.6. If K is an extension of F and / £ F[X], we say that ii' is a 
splitting field for / over F is f splits over K but not over any proper subfield 
of K containing F. 

Example 5.7. Consider the polynomial f{X) = X^ — 2 over Q. Its roots are 
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Alternatively, if C3 denotes a primitive 3rd root of unity, we can write the roots 
as 

The polynomial / is irreducible (for example using Eiscnstcin's criterion). Since 
it is also monic, it is the minimal polynomial of -v^, and 

[Q(^) :Q] =3. 

Now since \/2 and (or ^3) generate all the roots of /, the splitting field of 
/is 

A' = Q(^,2x/3) =Q(^,C3)- 

We finish by computing the degree of K over Q. Clearly cannot belong to 
Q{\/2) which is a subfield of M, thus [Q(v^,iV3) : Q(^)] is at least 2. Since 
iVSis a root oi +3 e Qiv^)[X], this de gree is exactly 2. By multiplicativity 
of the degrees, we get that 

[K : Q] = 6. 

Using that C3 is a root of + X + 1 stays irreducible over Q(-\/2) gives the 
same result. 

Equivalently, K is a splitting field for / over F if / splits over K and K is 
generated over F by the roots ai, . . . , afe of /, that is K = F{ai, . . . , a^). 

If / € F[X] and / splits over the extension E of F, then E contains a unique 
splitting field for /, namely f (ai, . . . , a^)- 

Here is a result on the degree of splitting fields. Note that the above example 
shows that this bound is tight. 

Proposition 5.6. If f E F[X] and degf = n, then f has a splitting field K 
over F with [K : F] < nl. 

Proof. First we may assume that n > 1, for if n = 0, then / is constant, and 
we take K = F with [K : F] = 1. 

Thus / has at least one root ai, and by Theorem 5.1, there is an extension 
El of F containing ai. Since /(ai) = 0, the minimal polynomial mi(X) of ai 
divides f{X), that is f{X) ~ mi{X)f'{X) for some f'(X), and since deg f = n, 
degmi{X) < n, implying that F{ai)/F has degree at most n. 

We may then further write f{X) = {X — aiY^g{X) where g{ai) 7^ and 
deg(7 < n — 1. If 5 is constant, then f{X) has no other root than ai, and its 
splitting field is F{ai)/F whose degree is at most n which is indeed smaller 
than nl. 

Now if g is non-constant, we can iterate on g the reasoning we did on /. 
Namely, we have that g has degree at least 1, and thus it has at least one root 
a2. Invoking again Theorem 5.1, there is an extension of F{ai) containing a2 
and the extension F{ai, 02) has degree at most n—1 over F{ai) (corresponding 
to the case where ri = 1). Thus we have 

[F{ai,a2):F] = [Fiaua2) : F{ai)][F{ai) : F] 
< [n - l)n. 
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We can now continue inductively to reach that if ai, . . . , a„ are aU the roots of 
/, then 

[F{ai,a2, . . . ,q;„) : F] < n\. 

□ 

If / G F[X] and / sphts over E, then we may take any root a of / and 
adjoin it to F to get the extension F{a). More precisely: 

Theorem 5.7. If a and /3 are roots of the irreducible polynomial f G F[X] in 
an extension E of F, then F{a) is isomorphic to F{P). 

Proof. If / is not monic, start by dividing / by its leading coefficient, so that we 
can assume that / is monic. Since / is monic, irreducible and f{a) = f{/3) = 0, 
/ is the minimal polynomial of a and /3, say of degree n. Now if a € F[a), then 
a can be uniquely written as 

a = oo + aia + . . . + a„_ia"~"'^. 

The map 

oq + aia + . . . + a„_ia"^"^ i-^ oo + ai/3 + . . . + a„-i/3"~"^ 

defines a field isomorphism between F{a) and F{(3). □ 

When discussing field isomorphisms, one may want to emphasize the base 
field. 

Definition 5.7. If E and E' are extensions of F, and t : — ?> i?' is an isomor- 
phism, we say that t is an i^-isomorphism if t fixes F, that is, if 

i{a) = a, a e F. 

Given a polynomial / G F[X], we have discussed its splitting field, namely 
the smallest field over which / splits. If F is Q, R or more generally C, not 
only we can find a splitting field for each polynomial, but we know that there is 
a field C with the property that any polynomial in C[X] splits over C, namely 
C = C itself. 

We now would like to express this property in general, without having to 
assume that F is Q, R or C. Namely, for a general field F, we want an extension 
C of F such that any polynomial in C[X] splits over C. We will later on add 
the requirement that this extension is algebraic. 

Proposition 5.8. If C is afield, the following conditions are equivalent. 
L Every non-constant polynomial f G C[X] has at least one root in C . 

2. Every non-constant polynomial f G C[X] splits over C . 

3. Every irreducible polynomial f G C[X] is linear. 



156 



CHAPTER 5. FIELD THEORY 



4-. C has no proper algebraic extension. 
Proof. Wc prove 1. =^ 2. ^ 3. ^ 4. =^ 1. 

1. 2. Take / e C[X] a non-constant polynomial. Since / has at least one root, 

we write / = {X — ai)g for g sonic polynomial in C[X]. If g is constant, 
we are done since / splits. If g is non-constant, then again by assumption 
it has one root and g ~ {X — a2)h for some h. We conclude by repeating 
inductively. 

2. 3. Take / G C[X] which is irreducible, thus non-constant. By assumption it 

is a product of linear factors. But / is irreducible, so there can be only 
one such factor. 

3. 4. Let E be an algebraic extension of C . Take a €z E with minimal polyno- 

mial / over C. Then / is irreducible and of the form X — a E C[X] by 
assumption. Thus a £ C and E = C . 

4. ^ 1. Let / be a non-constant polynomial in C[X], with root a. We can adjoin 

a to C to obtain C{a). But by assumption, there is no proper algebraic 
extension of C , so C{a) — C and a e C. Thus / has at least one root in 
C and we are done. 

□ 

Definition 5.8. A field C as described in the above equivalent properties is 
said to be algebraically closed. 

Examples 5.8. 1. The field M is not algebraically closed, since X^ + 1 = 
has not root in R. 

2. No finite field F is algebraically closed, since if ai, . . . , a„ are all the ele- 
ments of F. then the polynomial (X — ai) . . . (X — a„) + 1 has no zero in 
F. 

3. The field C is algebraically closed, this is the fundamental theorem of 
algebra. 

4. The field of all algebraic numbers is algebraically closed. (We will not 
prove this here, but for a proof that algebraic numbers in a field extension 
indeed form a field, see Corollary 5.11 below.) 

Wc can embed an arbitrary field F in an algebraically closed field as follows. 

Definition 5.9. An extension C of is called an algebraic closure if C is 
algebraic over F and C is algebraically closed. 

Examples 5.9. To get examples of algebraic closures, we thus need to start 
with known algebraically closed fields. 

1. The field C is the algebraic closure of R. 
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2. The field of all algebraic numbers is the algebraic closure of Q. 

Note that C is minimal among algebraically closed extensions of F. Indeed, 
let us assume that there is an algebraically closed field K such that C/K/F. 
Let a G C but a ^ K {it exists if we assume that C ^ K). Then a is algebraic 
over F^ and consequently algebraic over K. But since a ^ K, the minimal 
polynomial of a over K cannot contain the factor X — a, which contradicts that 
K is an algebraically closed field. 

Wc can prove the following theorems (wc will omit the proof). 

Theorem 5.9. L Every field F has an algebraic closure. 

2. Any two algebraic closures C and C' of F are F -isomorphic. 

3. If E is an algebraic extension of F , C is an algebraic closure of F , and u 
is an embedding of F into C. Then l can be extended to an embedding of 
E into C . 

Let us now prove the first transitivity property of field extensions. Several 
will follow later on in this chapter. 

Proposition 5.10. L If E is generated over F by finitely many elements 
ai, ...,«„ algebraic over F, then E is a finite extension of F. 

2. (Transitivity of algebraic extensions). If E is algebraic over K, and 
K is algebraic over F , then E is algebraic over F . 

Proof. 1. Set E'o = F, Ek ^ F{ai, . . . , ak), 1 < fc < n, in particular En = 
F(ai, . . . ,an) — E hy definition of E. Then Ek = Ek-i{ak), where 
ak is algebraic over F, and hence over Ek-i. Now [E^ : is the 

degree of the minimal polynomial of ak over E^-i, which is finite. By 
multiplicativity of the degrees, wc conclude that 

n 

[E:F]^ \{[Ek:Ek-i]<'x. 

k=l 

2. Let a G E with minimal polynomial 

m{X) ^bo + biX + ... + 6„_iX"-i + X" 

over K since by assumption a is algebraic over K. The coefficients hi 
are in K and thus are algebraic over F. Set L = F{bo, 6i, . . . , 6„_i), by 
the first part, L is a finite extension of F. Therefore m{X) £ L[X], a is 
algebraic over L, and L{a) is a finite extension of L. This gives us the 
following tower of field extensions: 

L(a)/L = F(6o,6i,...,6„_i)/F. 

By transitivity of the degrees, since [L : F] < oo and [L{a) : i] < oo, 
we get that [L{a) : F] < oo. We conclude since we know that all finite 
extensions are algebraic, and thus a is algebraic over F. 

□ 
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Corollary 5.11. If E is an extension of F and A is the set of all elements in 
E that are algebraic over F, then A is a subfield of E. 

Proof, li a, (3 G A, then the sum, difference, product and quotient (if /? 7^ 0) 
of a and f3 belong to F{a,/3), which is a finite extension of F by the first part 
of the above proposition. This is thus an algebraic extension since all finite 
extensions are, and thus a + f3, a — j3, a/3 and a//3 are in A, proving that A is 
a field. □ 



5.3 Separability 

If / is a polynomial in we have seen above that we can construct a splitting 

field K for / over F, and K is such that all roots of / lie in it. We can thus 
study the multiplicity of the roots of / in K. 

Definition 5.10. An irreducible polynomial / £ F[X] is separable if / has no 
repeated roots in a splitting field. It is called inseparable otherwise. Note that if 
/ is not necessarily irreducible, then we call / separable if each of its irreducible 
factors is separable. 

For example f{X) — {X ~ i)'^{X — 2) e Q is separable, since its irreducible 
factors X ~ 1 and X ~ 2 are separable. 

We start by computing a criterion to test if a polynomial has multiple roots. 

Proposition 5.12. Consider 

fix) = ao + aiX + --- + a„X" € F[X] 

and its formal derivative 

fix) = ai+ 2a2X + ■■■+ ?ia„X"-\ 

Then f has a repeated root in a splitting field if and only if the degree of the 
greatest common divisor of f and f is at least 1. 

Proof. Let us assume that / has a repeated root in its splitting field, say a. 
Then we can write 

fiX)^iX-aYhiX) 

where r > 2 since we consider a repeated root. Now we compute the derivative 
of/: 

fix) = riX - aY~^hiX) + iX- aYh'iX) 

and since ?- — 1 > 1, we have that (X — a) is a factor of both / and /'. 

Conversely, let us assume that the greatest common divisor g of / and /' 
has degree at least 1, and let a be a root of g (in a splitting field). By definition 
oi g, X — a \s then a factor of both / and /'. We are left to prove that a is a 
repeated root of /. Indeed, if it were not the case, then fiX) would be of the 
form fix) = iX — a)hiX) where hia) ^ and by computing the derivative, 
we would get (put r = 1 in the above expression for /') /'(a) = /i(a) 7^ which 
contradicts the fact that X — a is a factor of /'. □ 
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As a corollary of this result, we can exhibit two classes of separable polyno- 
mials. 

Corollary 5.13. 1. Over a field of characteristic zero, every polynomial is 
separable. 

2. Over a field F of prime characteristic p, an irreducible polynomial f is 
inseparable if and only if f is the zero polynomial ( equivalently f is in 
F[XP]). 

Proof. 1. Without loss of generality, consider / an irreducible polynomial in 
where F is of characteristic zero. If / is a polynomial of degree n, 
then its derivative /' is of degree less than n. and it cannot possibly be 
the zero polynomial. Since / is irreducible, the greatest common divisor 
of / and /' is either 1 or /, but it cannot be / since /' is of smaller degree. 
Thus it is 1, and / is separable by the above proposition. 

2. We now consider the case where F is of characteristic p. As above, we 
take / an irreducible polynomial of degree n in F[X] and compute its 
derivative /'. If /' is non-zero, we can use the same argument. But /' 
could also be zero, in which case the greatest common divisor of / and /' 
is actually /, and by the above proposition, / has a multiple root and is 
then not separable. That /' = means that / S since we work in 

characteristic p. 

□ 

Example 5.10. Polynomials over M.[X] and Q[X] are separable. 

Another class of separable polynomials are polynomials over finite fields, but 
this asks a little bit more work. 

Lemma 5.14. Let F be a finite field of characteristic p. Consider the map 

f:F^F, f{a) = . 

Then f is an automorphism (called the Frobenius Automorphism). In particular, 
we have for all a ^ F that 

for some (3 £ F . 

Proof. We have that / is a ring automorphism since 
/(I) = 1 

/(a + /3) = {a + PY^aP + pP = f{a) + f{P) 



The second set of equalities uses the binomial expansion modulo p. Now / is a 
monomorphism since is a field, and an injective map from a finite set to itself 
is necessarily surjective. □ 
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Proposition 5.15. Every polynomial is separable over a finite field F (of prime 
characteristic). 

Proof. Suppose that / is an irreducible polynomial which, by contradiction, has 
multiple roots in a splitting field. Using the criterion of the corollary, f{X) 
must be in namely 

f{X) =ao + a^XP + ■■■ + a„X''P, a, G F. 

Using the bijectivity of the Frobenius automorphism, we can write a; ~ b^, 
yielding 

(6o + 6iX + • • • + = bP + b^XP + ■■■ + bP^X^'P = f{X) 

which contradicts the irreducibility of /. □ 

Definition 5.11. If E is an extension of F and a £ E, then a is said to be 
separable over if a is algebraic over F and its minimal polynomial ^J.a,F is a 
separable polynomial. If every element of E is separable over F, we say that E 
is a separable extension of F or that E/F is separable. 

Examples 5.11. 1. Typical examples of separable extensions are finite fields 
and number fields. 

2. If F is a field with algebraic closure C, then C contains a smallest field 
containing all finite separable extensions of F, called the separable closure 
of F. It is a separable extension of F. 

Here is a first result on how separability behaves in a tower of extensions. 

Lemma 5.16. If E/K/F and E is separable over F, then K is separable over 
F and E is separable over K . 

Proof. K/F is separable. Since K is a subfield of E, every element P E K 
belongs to E, and every clement of E is separable over F by assumption. 

E/K is separable. Take a £ E. Since E is separable over F, it is in 
particular algebraic over F and we may consider the minimal polynomial fj,a,F 
of a over F. Denote by fJ-a.K the minimal polynomial of a over K, we have 

fJ'a,K I Ma,F- 

Since fJ.a,F has no repeated root, neither has fJ-a,K, and E/K is separable. □ 

The converse is also true, and gives the transitivity of separable extensions: 
If K/F and E/K are separable, then E/F is separable. 

It is less easy to construct inseparable extensions, but here is a classical 
example. 
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Example 5.12. Let Fp denote the finite field of integers modulo p. Consider 
the field F = ¥p{t) of rational functions in t with coefficients in the finite field 
with p elements Fp. We get a field extension of E/F by adjoining to a root of 
the polynomial — t (one has to check that — t is irreducible over Fp [t] ) . 
The extension E/F is inseparable since 

XP-t = XP- {<ftY = {X- Vt)p, 

which has multiple roots. 

Let E/F be a separable extension of F and let C be an algebraic closure of 
E. We next count the number of embeddings of in C that fix F, that is, the 
number of i^-monomorphisms of E into C. We start with a lemma. 

Lemma 5.17. Let a : E ^ E he an F -monomorphism and assume that f G 
F[X] splits over E. Then a permutes the roots of f , namely, if a is a root of f 
in E then so is a{a). 

Proof. Write f{X) as 

f{x) = bo + hx + --- + bnX", b,eF 

If a is a root of / in E, then 

/(a) ^bo + bia + --- + bna'' = 0. 

Apply a to the above equation, and use that ct is a field homomorphism that 
fixes F to get 

bo + bia{a) H h 6„cr(a)" = 0, 

showing that a-{a) is a root. □ 

Theorem 5.18. Let E/F be a finite separable extension of degree n, and let a 
be an embedding of F into an algebraic closure C. Then a extends to exactly n 
embeddings of E in C. Namely, there are exactly n embeddings t of E into C, 
such that the restriction t\p of t to F coincides with a. In particular, taking a 
to be the identity on F, there are exactly n F -monomorphisms of E into C. 

Proof. Wc do a proof by induction. If n = 1, then E = F and a extends to 
exactly 1 embedding, namely itself. 

We now assume that n > I and choose a E E, a ^ F. Let / = Ha.F be the 
minimal polynomial of a over F of degree say r. It is irreducible and separable 
{E/F is separable by assumption). In order to use the induction hypothesis, 
we need to split the field extension E/F, which we do by considering the field 
extension F(a), which satisfies 

E/F{a)/F, [E : F{a)] = n/r, [F(a) : F] = r. 

We first take care of the extension F{a)/F. Let a be an embedding of F into 
C, and define the polynomial g ~ cr(/), where a is applied on all the coefficients 
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of /. The polynomial g inherits the property of being irreducible and separable 
from /. Let /3 denotes a root of g. We can thus define a unique isomorphism 

F{a) ^ ia{F)){p), bo + b^a + . . . + bra"- ^ a(bo) + (t(6i)/3 + . . . + a{br)(3'- 

and restricted to F it indeed coincides with a. This isomorphism is defined by 
the choice of /?, and there are exactly r choices for it, corresponding to the r 
roots of g (note that this is here that the separability of g is crucial) . For each of 
these r isomorphisms, using the induction hypothesis on [E : F{a)] ~ n/r < n, 
we can extend them to exactly n/r embeddings of E into C. This gives us a 
total oi n/r ■ r distinct embeddings of E into C extending a. We conclude by 
noting that we cannot have more than n such embeddings. □ 

We conclude by giving a nice description of finite separable field extensions. 

Theorem 5.19. (Theorem of the Primitive Element). If E/F is a finite 
separable extension, then 

E = F(7) 

for some 7 G -E. We say that j is a primitive element of E over F. 

Proof. Since we have not studied finite fields yet, let us assume that F is an 
infinite field. (If you have already studied finite fields, then you know we can 
take 7 to be any generator of the cychc group E^). 

We proceed by induction on the degree n of the extension E/F. If n = 1, 
then E = F and we can take any element for a. 

Let us thus assume n > I, the assumption true up to n — 1, and say the 
degree of E/F is n. Choose a E E but not in F. We now look at the field 
extension E/F{a). By induction hypothesis, there is a primitive element (3 
such that 

E = F{a,P). 

We are now going to prove that there exists a c € F such that 

E = F{a + c/3), 

that is 

7 = a + c/3 

will be the primitive clement. We will show that it is enough to take c ^ S", where 
S" is a finite subset of F defined as follows: let / be the minimal polynomial of 
a over F^ and let g be the minimal polynomial of /3 over i^, the exceptional set 
S consists of all c e F such that 

a' — a 

for a' a root of / and /3' a conjugate of /3 (we extend F{a,l3) to a field L in 
which / and g both split to be able to speak of all their roots). 
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To show that 7 is primitive for c ^ S, it is enough to prove that F{a + cfi) 
contains /3 and a = 7 — c/3 (clearly the reverse inclusion holds: F{a + c/3) C 
F{a,j3)). To this end, it is enough to show that the minimal polynomial of /3 
over ^"(7) cannot have degree greater or equal to 2, implying that /3 is in F{'^). 

Note first that if we take the polynomial h{X) defined by 

h{x)^ f{^-cX)eF{^)[x] 

and evaluate it in /3, we get 

h{P) = /(7 - c/3) = f{a + c/3 - c/3) = 0. 

Thus /? is a root of h and the minimal polynomial of /3 over ^"(7) divides both 
g and ft-, so we are done if we show that the greatest common divisor of g and 
h in F{'^)[X] cannot have degree greater or equal to 2. 

Suppose the greatest common divisor does have degree> 2. Then g and h 
have as common root in L not only /3, but also /3' 7^ /3 in L. This is where we 
use the separability of g, since otherwise /? could be a root with multiplicity 2. 
Then 

/(7 - cP') = ^ 7 - c/3' = a' 
for some root a' of /, which can be rewritten as 

a + c/3-c/3' = 0^c-^^ 

which is exactly what was ruled out by choosing c ^ S. □ 

Definition 5.12. A simple extension is a field extension which is generated by 
the adjunction of a single element. 

Thus the primitive clement Theorem above provides a characterization of 
the finite extensions which arc simple. 

Example 5.13. Number fields are simple extensions. 



5.4 Normality 

So far, we have considered two properties of field extensions (both of them being 
transitive): being algebraic and separable. We now introduce a third property, 
which is not transitive, the one of being normal. 

Definition 5.13. An algebraic extension E/F is normal if every irreducible 
polynomial over F that has at least one root in E splits over E. If we call the 
other roots of this polynomial the conjugates of a, we can rephrase the definition 
by saying that if a G E, then all conjugates of a over F are in E. 

Note that this definition assumes that we start with an algebraic extension. 



164 



CHAPTER 5. FIELD THEORY 



Example 5.14. Consider the field extension E = Q(v^)/Q. The roots of the 
irreducible polynomial f{X) = — 2 arc 



where ^3 is a primitive 3rd root of unity (for example (3 = e^'^'/'^). Thus E is 
not a normal extension. 

We can give another characterization in terms of monomorphisms of E. 

Theorem 5.20. The finite extension E/F is normal if and only if every F- 
monomorphism of E into an algebraic closure C is actually an F -automorphism 
of E. (Finite could be replaced by algebraic, which we will not prove). 

Proof. If E/F is normal, then an _F-monomorphism of E into C must map each 
element of E to one of its conjugates (as is the case in the proof of Lemma 5.17). 
Thus t{E) C E, but t{E) is an isomorphic copy of E and thus has the same 
degree as E and E = t{E), showing that r is indeed an i^-automorphism of E. 

Conversely, consider a £ E and let /3 be a conjugate of a over F. There exists 
an i^-monomorphism of E into C that carries a to /3 (the construction is given 
in the proof of Theorem 5.18). If all such embeddings are i^-automorphisms of 
E, that means f3 must be in E, and we conclude that E/F is normal. □ 

Here is another characterization of normal extensions in terms of splitting 



Theorem 5.21. The finite extension E/F is normal if and only if E is a 
splitting field for some polynomial f in F[X]. 

Proof. Let E/F be a finite normal extension of degree n, and let ai, . . . , a„ be 
a basis for E over F. Consider for each its minimal polynomial fi over F. 
By definition of normal extension, since fi has a root in E, then fi splits over 
E, and so does the polynomial 



To prove that E is a splitting field, we are left to prove it is the smallest field 
over which / splits. This is here that we understand why we take such an /. If 
/ were to split over a subfield K, that is K such that 



then each a,; € K, and K ^ E (this is a conclusion we cannot reach if we take 
for / only one fi or a subset of them) . This proves that _B is a splitting field for 
/ over F. 

Conversely, let be a splitting field for some / over F, whose roots are 
denoted by ai, . . . Let r be an _F-monomorphism of E into an algebraic 
closure, that is r takes each into another root of /. 




fields. 



f — fi ■ ■ ■ fn- 



F C K C E 
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Since i? is a splitting field for /, we have 

F{ai, . . . ,a„) = E 
and t{E) C E. Thus since E and t{E) have same dimension, we get that 

t{E) = E 

and T is actually an automorphism of E, and by the above theorem, we conclude 
the E/F is normal. □ 

As a corollary, we see how a subextension inherits the property of normality. 

Corollary 5.22. Let E/K/F be a finite extension ([E : F] < oo). If E/F is 
normal, so is E/K. 

Proof. Since E/F is normal, _E is a splitting field for some polynomial / e ^'[-'f], 
that is E is generated over F by the roots of /. Since / € F[X] C ^'^i-'^], / can 
also be seen as a polynomial in K[X] and E is generated over K by the roots 
of /, and again by the above theorem, E/K is normal. □ 

There is no reason for an arbitrary field extension E/F to be normal. How- 
ever, if E/F is finite (or more generally algebraic) one can always embed it in 
normal extension. 

Definition 5.14. Let E/F be an algebraic extension. The normal closure of 
E/F is an extension field N oi E such that N/E is normal and A'' is minimal 
with this property. 

li E/F is finite, we can see it as follows: E is finitely generated over F, so 
it can be written as E = F{ai, . . . , a„). Let now K be a normal extension of 
F that contains E: 

K/E/F. 

Since K is normal, it must contain not only all the Ui but also all their con- 
jugates. Let fi be the minimal polynomial of a^, i = 1, . . . ,n. Then we can 
rephrase the last statement and say that K must contain all the roots of /i, 
i = 1, . . . , n. Consider the polynomial 

/ = /i • • • /n- 

Then K must contain the splitting field N for / over F. But N /F is normal, 
so N must be the smallest normal extension of F that contains E. Thus TV is a 
normal closure of E over F . 
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The main definitions and results of this ehapter are 

• (3.1). Definitions of: field extension, minimal poly- 
nomial, degree of a field extension, field homomor- 
phism, algebraic, transcendental. That the degree is 
multiplicative. 

• (3.2). Definitions of: to split, splitting field, alge- 
braically closed, algebraic closure. Transitivity of al- 
gebraic extensions. 

• (3.3). Definition of separability, typical separable ex- 
tensions, separability in extension towers, number of 
embeddings into an algebraic closure, primitive ele- 
ment Theorem. 

• (3.4). Definition of normality, two equivalent char- 
acterizations of normal extensions. 



Chapter V>/ 

Exercises for Field Theory 



Exercises marked by (*) are considered difficult. 

6.1 Field extension and minimal polynomial 

Exercise 88. 1. For which of the following p{X) do there exist extensions 
K{a) of K for which a has minimal polynomial p{X)7 

• p{X) = -4, K ^ R. 

• p{X) = X"^ + 1, A' = Z5 (integers modulo 5). 

• p{X) = X'^ + 2, K = Q. 

In the case where you obtain a field extension, what is the degree of the 
extension? 

2. Find an irreducible polynomial of degree 2 over the integers modulo 2. 

Use it to construct a field with 4 elements. Describe the obtained field. 
Answer. 

1. p{X) = - 4 = (X - 2)(X + 2), it is not irreducible so it cannot be 
a minimal polynomial. Then p{X) ~ X^ + 1 = {X ~ 2){X + 2) modulo 
5, so it is not irreducible, and cannot be a minimal polynomial. Finally 
X^ + 2 is irreducible, monic, we obtain the field extension Q(v^)/Q, it is 
of degree 3. 

2. Take the polynomial X"^ + X + 1, it has no root modulo 2 and is thus 
irreducible. We can construct a field using the generic construction that 
we know. The field Z2 [X]/ contains a root a of the polynomial, 
it is a field containing 4 elements. Indeed, it is of degree 2 (degree of the 
minimal polynomial), and a basis is given by {1,0;}, thus every element 
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can be written as a + ba, a,b € Z2. That makes 4 possible elements, and 
the field is described by 

Z2[X]/{X^ +X + l)~{a + ba,a,be Z2}. 

Exercise 89. 1. Show that C/R is an algebraic extension. 

2. Compute the degree ofthe following extensions: Q{\/2)/Q, Q(\/3+\/2)/Q. 

3. Let E = Q(V2) and F = Q(^^/2). Show that -1 is a sum of 2 squares in 
F. Deduce that E and F are not isomorphic. 

Answer. 

1. C/M is an extension of degree 2 (a R-basis is {1, i}), it is thus finite, thus 
algebraic. 

2. [Q(^) : Q] - 3 (a Q-basis is {1, ^, [Q{V^ +_V2) : Q] = 4 (a 
Q-basis is {1,^2, VS, VG}, because Q{V3 + V2) = Q(V3, V2)). 

3. In F, we have that + 1^ = —1. Since both fields have the same 
degree and knowing that a field homomorphism is always injective, we try 
to build a ring homomorphism / from F to E. Thus 

f{{zV2r + I') = /(-I) ^ f{{^V2f) + /(I) = -/(I) 

since / is a ring homomorphism, fm'thermorc, it must send /(I) to 1, thus 
we must have 

that is there must be an element of E whose square is negative which is 
not possible. 

Exercise 90. Consider the extension C/R. What are all the R-automorphisms 
of C? Justify your answer. 

Answer. Write an element x G C a.s x = a + ib, a, b E M., and let a be an 
R-automorphisms. Thus 

(^{x) ~ cr(a) + (7{i)a(b) = a + CT{i)b 

using for the first equality the property of ring homomorphism, and for the 
second one that a fixes R. Thus cf{x) is determined by oii). Since 1^ = —1, we 
have that = it(— 1), that is 

a{if + 1 = 0. 

Thus either a{i) = i or a{i) = — i, which are the only two possible R-automorphims 
of C. 
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Exercise 91. Prove that if [K{u) : K] is odd, then K{u) = K{u'^). 

Answer. We first notice that K{v?) C K{u)^ thus 

[K{u) : K] ^ [K{u) : /^(u^)] [/^(m^) : K]. 

Since u is a root of the polynomial X'^ — u'^ in K{u'^)[X], we have that [K{u) : 
K^u^)] < 2, and it cannot be 2 because [K{u) : K] is odd, thus [K{u) : K^u"^)] — 
1 and the conclusion follows. 

6.2 Splitting fields and algebraic closures 

Exercise 92. What is the splitting field of the following polynomials? 

1. f(x)^{x^-3){x^ + l)eQ{x). 

2. fix) ^x^+x + le ¥2[x]. 
Answer. 

1. We have that /(X) = {x-V3){x+Vi){x-l)(x^+x+ 1), thus the splitting 
field of / must contain \/3 and C3, the primitive third root of unity. This 
then must be Q{i, -y/S). 

2. We have that x"^ + x + 1 is irreducible over F2, we can construct F4 as 
^2[x]/{f{x)), that isF4 ~ F2(w) where + 1 = 0. Thus the splitting 
field of / is F4. 

6.3 Separability 

6.4 Normality 

Exercise 93. Show that Q(^)/Q is not normal. 

Answer. The roots of x^ ~ 5 are s/S, C3 v^, Cf "v^, where (3 denote a primite 
3rd root of unity. Since Q{\^)/Q is totally real, it cannot contain the complex 
roots. 

Exercise 94. Arc the following claims true or false? Justify your answer. 

1. Every polynomial splits over some field. 

2. The polynomial a;"^ + 5 is separable over F7. 

3. Every finite extension is normal. 

4. Every separable extension is normal. 

5. Every finite normal extension is a splitting field for some polynomial. 
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6. A reducible polynomial cannot be separable. 
Answer. 

1. This is true, for every root of the polynomial, there is a field that will 
contain this root, so that we can build a field extension containing all the 
roots (if the polynomial has coefficients in M, then one can use C, but C 
will not work if the polynomial has coefficients in a finite field). 

2. True since F7 is a finite field. 

3. False, Q(v^5)/Q is finite but not normal. 



4. False, Q(v5)/Q is separable (because Q is of characteristic zero) but not 
normal. 

5. True, we proved this. 

6. False, when a polynomial is reducible, the definition of separability applies 
on its irreducible factors, which may or may not be separable. 

Exercise 95. True/False. 

Ql. Every field has non-trivial extensions. 

Q2. Every field has non-trivial algebraic extensions. 

Q3. Extensions of the same degree are isomorphic. 

Q4. Every algebraic extension is finite. 

Q5. Every algebraic extension of Q is finite. 

Q6. Every extension of a finite field is finite. 

Q7. The polynomial + 5 is separable over Zj {— integers modulo 7). 

Q8. Every finite extension is normal. 

Q9. Every separable extension is normal. 
QIO. Every if-monomorphism is a /^-automorphism. 
Qll. Every extension of a field of characteristic is normal. 
Answer. 

Ql. That's true! We are not speaking of algebraic extensions necessarily. Even 
if you take C, you can for example get function fields over C by adding an 
indeterminate. 

Q2. We know that one of the characterizations of algebraically closed fields is 
that they have no non-trivial algebraic extensions! So that is one counter 
example. 
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Q3. False! It's the other way round: if two extensions are isomorphic, then 
they have the same degree. 

Q4. False, it is the other way round! If an extension is finite, it is algebraic. If 
it is algebraic it does not have to be finite (take an algebraic closure). 

Q5. Still false. Taking Q as the base field does not change anything to the 
problem. The same counter example as in the previous question holds: 
you can take an algebraic closure of Q, it is algebraic and infinite. 

Q6. This is still false! You can build a function field as a counter example. 

Q7. It is true. We have proved this result in general for fields of characteristic 
zero and finite fields. 

Q8. It's false! There is no connection between both concepts. For example, 
we know that Q(a) with = 2 is finite and not normal. 

Q9. It is false! There is no connection, you can take as above Q(a) with — 2, 
it is separable and not normal. 

QIO. This is false! For a counter example, take any extension which is not 
normal. You'll find a A'-monomorphism which is not a A'-automorphism. 

Qll. This is wrong! Imagine this were true, then all number fields would be 
normal, this is surely not the case!! 
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Chapter / 

Galois Theory 



Galois theory is named after the French mathematician Evariste Galois. 

Galois was born in 1811, and had what could be called the life of a misun- 
derstood genius. At the age of 15, he was already reading material written for 
professional mathematicians. He took the examination to the "Ecole Polytcch- 
nique" to study mathematics but failed and entered the "Ecole Normale" in 
1828. He wrote his first paper at the age of 18. He tried to advertise his work, 
and sent his discoveries in the theory of polynomial equations to the Academy 
of Sciences, where Cauchy rejected his memoir. He did not get discouraged, and 
in 1830, he wrote again his researches to the Academy of Sciences, where this 
time Fourier got his manuscript. However, Fourier died before reading it. 

A year later, he made a third attempt, and sent to the Academy of Sciences 
a memoir called "On the conditions of solvability of equations by radicals". 
Poisson was a referee, and he answered several months later, declaring the paper 
incomprehensible. 

In 1832, he got involved in a love affair, but got rejected due to a rival, who 
challenged him to a duel. The night before the duel, he wrote a letter to his 
friend, where he reported his mathematical discoveries. He died during the duel 
with pistols in 1832. 

It is after his death that his friend insisted to have his letter published, which 
was finally done by the mathematician Chevalier. 

7.1 Galois group and fixed fields 

Definition 7.1. If E/F is normal and separable, it is said to be a Galois 
extension, or alternatively, we say that E is Galois over F. 

Take E/F a Galois extension of degree n. Since it is separable of degree n, we 
know that there are exactly n i^-monomorphisms of E into an algebraic closure 
C. But E/F being also normal, every i^-automorphism into C is actually and 
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_F-autoniorphism of E. Thus there are exactly n — [E : F] i^-automorphisms of 
E. 

We can define the notion of a Galois group for an arbitrary field extension. 

Definition 7.2. If E/F is a field extension, the Galois group of E/F, denoted 
by Gal{E / F), is the set of F-automorphisms of E. It forms a group under the 
composition of functions. 

Example 7.1. If = Q{\/2), then Gal(£;/Q) = {1}, that is the identity on E. 

The above example illustrates the fact that though one can always define a 
Galois group, we need the extension to be actually Galois to say that the order 
of the Galois group is actually the degree of the field extension. 

Definition 7.3. Let G = Gal{E/F) be the Galois group of the extension E/F. 
If iJ is a subgroup of G, the fixed field of H is the set of elements fixed by every 
automorphism in H, that is 

J'(iJ) ^{xe E, a{x) = X for ah a G H}. 

Vice-versa, if K is an intermediate field, define 

g{K) = Gal{E/K) = {cr e G, <7{x) = x for all x S K}. 

It is the group fixing K. 

Galois theory has much to do with studying the relations between fixed fields 
and fixing groups. 

Proposition 7.1. Let E/F he a finite Galois extension with Galois group G = 
G'a\{E/F). Then 

L The fixed field of G is F. 



7.1. GALOIS GROUP AND FIXED FIELDS 



175 



2. If H is a proper subgroup of G, then the fixed field T{H) of H properly 
contains F. 

Proof. 1. Let Fq be the fixed field of G (and wc have the field extensions 
E/Fq/F). We want to prove that Fq = F. 

We first note that if a is an F-automorphism of E (that is cr is in G), 
then by definition of i^Oj fixes everything in Fq, meaning that a is an 
-Fo-automorphism. Thus the F-automorphisms in the group G coincide 
with the Fo"£^utomorphisms in the group G. 

Now we further have that E/Fq is Galois: indeed, we have E/Fq/F with 
E/F Galois thus normal and separable, and E/Fq inherits both properties. 

We now look at the degrees of the extensions considered: 

|Gal(F/Fo)| = [E : Fo], |Gal(£;/F)| = [E : F], 

since both arc Galois. Furthermore by the first remark, the number of F— 
and Fq— automorphisms in G coincide: 

|Gal(F/Fo)| = \Ga\{E/F)\ 

showing that 

[E : Fo] = [F : F] 
and by multiplicativity of the degrees 

[F : F] = [F : Fo][Fo : F] ^ [Fq : F] = 1 

and F ^ Fq. 

2. In order to prove that F C J-{H), let us assume by contradiction that 
F = T{H). 

Since we consider a finite Galois extension, we can invoke the Theorem of 
the Primitive Element and claim that 

F = F(a), a e F. (7.1) 

Consider the polynomial 

fiX)= ll{X-CTia))eE[X]. 

It is a priori in F[X], but we will prove now that it is actually in F[X]. 
Since by contradiction we are assuming that F = J-'{H), it is enough to 
proof that f{X) is fixed by H. Indeed, take t E H, then 
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since rcr ranges over all H as does a. 

Thus ,f{X) £ F[X] and /(a) = (cr must be the identity once while 
ranging through H). Now on the one hand, we have 

degf^\H\<\G\^[E:F] 

since we assume that H is proper and E/F is Galois. On the other hand, 

dcgf>[F{a):F] = [E:F] 

since / is a multiple of the minimal polynomial of a over F (equality holds 
if / is the minimal polynomial of a over F), and E = F{a) by (7.1). We 
cannot possibly have deg/ < [E : F] and deg/ > [E : F] at the same 
time, which is a contradiction and concludes the proof. 

□ 



7.2 The fundamental Theorem of Galois theory 

The most significant discovery of Galois is that (surely not in these terms!) 
under some hypotheses, there is a one-to-one correspondence between 

1. subgroups of the Galois group Gal{E/F) 

2. subfields M of E such that F C M. 
The correspondence goes as follows: 

• To each intermediate subfield M, associate the group Gal(£'/A/) of all 
M-automorphisms of E: 

Q = Gal : {intermediate fields} — > {subgroups of Gal(i?/F)} 

M ^ g{M) = G&\{E/M). 

• To each subgroup H of Gal(£'/i^), associate the fixed subfield T{H): 

F : {subgroups of Ga\{E / F)} — > {intermediate fields} 

H ^ T{H). 

We will prove below that, under the right hypotheses, we actually have a 
bijcction (namely G is the inverse of J^). Let us start with an example. 



Example 7.2. Consider the field extension E = Q(i,v5)/Q. It has four Q 
automorphisms, given by (it is enough to describe their actions on i and \^): 



(Ti : 


i i, 


x/5 




x/5 


0-2 : 


i 1— — i, 


V5 






c^a : 


i I— z, 


V5 






f74 : 


i 1— —i, 


V5 




-V5 
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thus 

Gal{E/Q) = {(71,^2,^3,^4}. 
The proper subgroups of Gal(i?/Q) are 

{cTi}, {(Ti, 0-2}, {cti, 0-3}, {cti, (74} 

and their corresponding subfields are 

E, Q{V5), Q{i), QiiV5). 

We thus get the foUowing diagram: 

E 




2 

2 




Theorem 7.2. iet E'/F be a finite Galois extension with Galois group G. 

L The map T is a bijection from subgroups to intermediate fields, with in- 
verse Q. 

2. Consider the intermediate field K = T{H) which is fixed by H , and a £ G. 
Then the intermediate field 

aK = {cr(x), X e K} 

is fixed by aHa~^ , namely aK ~ T{aHa^^). 

Proof. 1. We first consider the composition of maps 

H ^ F{H)^gF{H). 

We need to prove that QF{H) = H. Take a in H, then a fixes J-{H) by 
definition and a G Ga\{E/F{H)) = g{F{H)), showing that 

H C gF{H). 

To prove equality, we need to rule out the strict inclusion. If H were 
a proper subgroup of Q{!F{H))^ by the above proposition the fixed field 
J^{H) of H should properly contain the fixed field of QF{H) which is 
J^{H) itself, a contradiction, showing that 

H = gF{H). 
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Now consider the reverse composition of maps 

This time we need to prove that K = FQ{K). But 

Fg{K) = fixed field by G&\{E/K) 
which is exactly K by the above proposition (its first point). 

2. It is enough to compute T{aHa^^) and show that it is actually equal to 
aK = aT{H). 

F{GHa^^) = {x e E, aT(T^^{x) = x for all t e H} 

= {x e E, Ta-\x) ^ a~\x) al\T e H} 

= {x<eE, a-\x) e T{H)} 

= {xeE, xe (t{T{H))} = a{T{H)). 

□ 

We now look at subextensions of the finite Galois extension E/F and ask 
about their respective Galois group. 

Theorem 7.3. Let E/F be a finite Galois extension with Galois group G. Let 
K he an intermediate suhfield, fixed by the subgroup H . 

L The extension E/K is Galois. 

2. The extension K/F is normal if and only if H is a normal subgroup of G. 

3. If H is a normal subgroup of G, then 

Gal(A7F) ~ G/H = Ga\{E / F) / G&\{E / K) . 

4-. Whether K/F is normal or not, we have 

[K:F] = [G:H]. 

Proof. 1. That E/K is Galois is immediate from the fact that a subextension 
E/K/F inherits normality and separability from E/F. 

2. First note that a is an F-monomorphism of K into E if and only if a is 
the restriction to K of an element of G: if a is an _F-monomorphism of K 
into E, it can be extended to an A-monomorphism of E into itself thanks 
to the normality of E. Conversely, if t is an A-automorphism of E, then 
a = t\x is surely a A-monomorphism of K into E. 

Now, this time by a characterization of a normal extension, we have 
K/F normal ^==^ o-(A') = K for all a e G. 



7.3. FINITE FIELDS 179 



Since K = T{H), we just rewrite 

K/F normal cf{F{H)) = F{H) for all a e G. 

Now by the above theorem, we know that a{F{H)) = F{aHa^^), and we 
have 

K/F normal TiaHa^^) = F{H) for ah cr e G. 

We are almost there, we now use again the above theorem that tells us 
that T is invertible, with inverse t/, to get the conclusion: 

K/F normal •^=^ aHa^^ = H for aU a e G. 

3. To prove this isomorphism, we will use the 1st isomorphism Theorem for 
groups. Consider the group homomorphism 

Gal(£;/i^) ^ Ga\{K/F), a ^ (j\k. 

This map is surjective (we showed it above, when we mentioned that we 
can extend a\K to a. Its kernel is given by 

Kcr = {a, (j\k = 1} = H = Gsil{E/K). 

Applying the 1st isomorphism Theorem for groups, we get 

Gal(A7i^) ~ Gal(£;/F)/Gal(£;/i^). 

4. Finally, by multiplicativity of the degrees: 

[E : F]^[E: K][K : F]. 
Since E/F and E/K are Galois, we can rewrite 

\G\ = \H\[K:F]. 
We conclude by Lagrange Theorem: 

[G:H] = \G\/\H\ = [K:F]. 

□ 

7.3 Finite fields 

We will provide a precise classification of finite fields. 
Theorem 7.4. Let E be a finite field of characteristic p. 
1. The cardinality of E is 

for some n > 1. It is denoted E = Fpn . 
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2. Furthermore, E is the splitting field for the separable polynomial 

f{X) ^XP" -X 

over ¥p, so that any finite field with p" elements is isomorphic to E. In 
fact, E coincides with the set of roots of f. 

Proof. 1. Let Fp be the finite field with p elements, given by the integers 
modulo p. Since E has characteristic p, it contains a copy of Fp. Thus E 
is a field extension of Fp, and we may see £' as a vector space over Fp. If 
the dimension is n, then let ai, . . . , q;„ be a basis. Every x in E can be 
written as 

X = Xiai + • • • + XnCtn 

and there are p choices for each Xi, thus a total of p" different elements 
in E. 

2. Let E^ be the multiplicative group of non-zero elements of E. If a G E^ , 
then 

aP"-^ = 1 
by Lagrange's Theorem, so that 

aP = a 

for all a in E (including a = 0). Thus each element of i? is a root of /, 
and / is separable. 

Now / has at most p" distinct roots, and we have already identified the 
p" elements of E as roots of /. 

□ 

Corollary 7.5. If E is a finite field of characteristic p, then E/¥p is a Galois 
extension, with cyclic Galois group, generated by the Frobenius automorphism 

cr : .T I— > (j{x) = xP , X Cz E. 

Proof. By the above proposition, we know that E is a splitting field for a sepa- 
rable polynomial over Fp, thus E/¥p is Galois. 
Since xP = x for all x in Fp, we have that 

Fp C 

that is Fp is contained in the fixed field of the cyclic subgroup generated by the 
Frobenius automorphism a. But conversely, each element fixed by cr is a root 
of XP — X so has at most p elements. Consequently 

Fp = ^((a)) 

and 

Gal(S/Fp) = (a). 

□ 
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This can be generalized when the base field is larger than ¥p. 

Corollary 7.6. LetE/F be a finite field extension with \E\ — p" and \F\ = . 
Then E/F is a Galois extension and •m\n. Furthermore, the Galois group is 
cyclic, generated by the automorphism 

T : X ^ t{x) = x^ , X € E. 

Proof. If the degree [E : F] ~ d, then every x in E can be written as 

X = xiai H h Xciad 

and there are p™ choices for each Xi, thus a total of 

different elements in E, so that 

d = m/n and m\n. 

The same proof as for the above corollary holds for the rest. □ 

Thus a way to construct a finite field E is. given p and n, to construct 
E = Fpii as a splitting field for — X over Fp. 

Theorem 7.7. If G is a finite subgroup of the multiplicative group of an arbi- 
trary field, then G is cyclic. Thus in particular, the multiplicative group E^ of 
a finite field E is cyclic. 

Proof. The proof relies on the following fact: if G is a finite abelian group, it 
contains an clement g whose order r is the exponent of G, that is, the least 
common multiple of the orders of all elements of G. 

Assuming this fact, we proceed as follows: if a; G G, then its order divides r 
and thus 

x'' ^ 1. 

Therefore each element of G is a root of X^ — 1 and 

\G\<r. 

Conversely, |G| is a multiple of the order of every clement, so |G| is at least as 
big as their least common multiple, that is 

\G\>r 

and 

\G\^r. 

Since the order of |G| is r, and it coincides with the order of the element g 
whose order is the exponent, we have that G is generated by g, that is G = (g) 
is cycfic. □ 
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Since E^ is cyclic, it is generated by a single element, say a: 

E = Fp(a) 

and a is called a primitive element of E. The minimal polynomial of a is called 
a primitive polynomial. 

Example 7.3. Consider the following irreducible polynomial 

g{X) = X^ + X + l 

over F2. Let a be a root of g{X). A direct computation shows that a is 
primitive: 

a" = 1, . . . , a'' = a + 1, . . . , = + a + 1, . . . , a^'' = 1 + a^. 

7.4 Cyclotomic fields 

Definition 7.4. A cyclotomic extension of a field is a splitting field E for 
the polynomial 

f{X) = X" - 1 
over F. The roots of / are called nth roots of unity. 

The nth roots of unity form a multiplicative subgroup of the group i?^ of 
non-zero elements of E, and thus must be cyclic. A primitive nth root of unity 
is an nth root of unity whose order m. E^ is n. It is denoted Cn- 

From now on, we will assume that we work in a characteristic char(F) such 
that char(F) does not divide n. (Otherwise, we have n = 7nchar(i^) and = 

Q-l = (C™ - l)char(F) ^^^^ ^^Y^ ^^^^^ -g jggg ^j^^^^^ „ ) 

Example 7.4. The field Q(Cp) where p is a prime and Cp is a primitive pth root 
of unity is a cyclotomic field over Q. 

Let us look at the Galois group Ga\{E/F) of the cyclotomic extension E/F. 
Then a E Gsl\{E/F) must map a primitive nth root of unity C„ to another 
primitive nth root of unity with (r, n) = 1. We can then identify a with r, 
and this shows that 

Gal(£;/F) ~ Un 

where [/„ denotes the group of units modulo n. This shows that the Galois 
group is abelian. 

Example 7.5. Consider the field extension Q(C3)/Q- We have 

X^ -1 = {X - 1){X^ + X + 1). 
The Galois group is given by: 



: Cs ^ C3 
o-^ : Cs ^ Cs 
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and the group U3 of units modulo 3 is t/3 = {1, 2}. Thus 

Gal(Q(C3)/Q) = {a, 1} ^ {2, 1} = (Z/3Z) ^ . 

Finally, since E/F is Galois (under the above assumption) 

[E:F] = \Ge.l{E/F)\^ipin) 

where ip{n) is the Euler toticnt function. 

From now on, we fix the base field F = Q. This means that a primitive nth 
root of unity ^„ is given by 

Definition 7.5. The nth cyclotomic polynomial is defined by 

{i,n) = l 

where the product is taken over all primitive nth roots of unity in C. 

The degree of ^'„(X) is thus 

deg(*„) = ip{n). 

Example 7.6. For n = 1, 2, we have 

= X -1, *2(^) = x - (-1) + 1. 

Computing a cyclotomic polynomial is not that easy. Here is a formula that 
can help. 

Proposition 7.8. We have 

X"-l = []*d(X). 

d\7i 

In particular, if n ~ p a prime, then d is either 1 or p and 
- 1 = = (X - l)*p(X) 

from which we get 

YP — 1 

'J-p(X) = — = + xp-^ + --- + X + 1. 

X — 1 

Proof. We prove equality by comparing the roots of both monic polynomials. 
If C is a nth root of unity, then by definition 

C = o 

and its order d divides n. Thus C is actually a primitive dth root of unity, and 
a root of ^d{X). 

Conversely, if d\n, then any root of 'i>d{X) is a dth root hence a nth root of 
unity. □ 
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Examples 7.7. For ti = 3 and 5, we have a prime and thus we can use the 
above formula: 

*3(X) = X'^+X + 1 

^'5(X) = +X + 1. 

For n = 4 the primitive 4rth roots of unity are ±i, and by definition 

*4(^) = {X- i){X + i)=X^ + l. 

Finally for n = 6, the possible values for d are 1,2,3 and 6. Thus 

"^'^^^ = {x-i){x + i)ix-^ + x + i) = - ^ + 1- 

From the above examples, it is tempting to say that in general has 
integer coefficients. It happens to be true. 

Proposition 7.9. The nth cyclotomic polynomial satisfies 

*„(x)eZ[X]. 

Proof. We proceed by induction on n. It is true for rt = 1 since X — 1 E Z[X]. 
Let us suppose it is true for '^^{X) where fc is up to n — 1, and prove it is also 
true for n. 

Using the above proposition, we know that 
X"-l = H'fdiX) 

d\n 

= vi/„(x) n ^4x). 

d\n,d<n 

The aim is to prove that ^n(^) ^ I^[X]: 

vn 1 

l[d\n.d<n'^d[X) 

First note that has to be monic (by definition), and both X" — 1 and 

'i>d{X) (by induction hypothesis) are in Z[X]. We can thus conclude invoking 
the division algorithm for polynomials in Z[X]. □ 

We conclude by proving the irreducibility of the cyclotomic polynomials. 

Theorem 7.10. The cyclotomic polynomial ^'„(X) is irreducible over Q. 

Proof. Let f{X) be the minimal polynomial of C„, a primitive nth root of imity 
over Q(X). We first note that by definition f(X) is monic, and thus since 
fiX)\X" - 1, we have 

X" - 1 = f{X)g{X) (7.2) 
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and f{X) and g{X) must be in Z[X]. 

To prove that ^'^(a;) is irreducible, we will actually prove that 

= f{X). 

To prove the equality, it is enough to show that every root of ^'„(X) is a root 
of/W. 

We need the following intermediate result: if p does not divide n, then 

fiCn) - 0. 

Let us prove this result. Suppose by contradiction that this is not the case, 
namely /(C^) 7^ 0. By (7.2), wc have 

X" - 1 = fiX)giX), 

which evaluated in X = Cn yields 

(CD" -1 = = fiOgiO 

implying by our assumption that f{Cn) that 

giO = 0, 

or in other words, is a root of g{XP) . But by definition of minimal polynomial, 
we have that f{X) must then divide g{XP), that is 

g{XP) = fix)h{x), h{x)enx]. 

Since g{X'''), f{X) and h{X) are in we can look at their reduction modulo 

p, that is work in Fp[X]. We will denote p{X) the polynomial obtained from 
p{X) by taking all its coefficients modulo p: \ip{X) = X]"=o "^i^N then p{X) = 
J27=o(^i modp)X*. Therefore 

g{XP)^f{X)h{X)e¥,[X]. 
By working in Fp[X], we are now allowed to write that 

-g{XP) = -giXr 

and thus 

giXr ^ f{X)h{X)e¥p[X]. 

This tells us that any irreducible factor of f{X) divides g{X) and consequently 
/ and g have a common factor. Looking at (7.2) in Fp[X] gives 

x"-i = f{x)h{x)e¥p[x]. 

Since / and g have a common factor, X" — 1 has a multiple root, which cannot 
be since we have assumed that p does not divide n. This proves the claim. 
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To summarize, we have just proven that if p does not divide n, then f{Cn) is 
another root of /. Since aU primitive nth roots of unity can be obtained from 
Cn by successive prime powers, we have that all primitive nth roots of unity are 
actually roots of f(X), and we know that there are (p{n) of them, which is also 
the degree of \E'„(X). This concludes the proof, since 

degfiX) > ^{n) = deg(*„(X)) ^ /(X) = ^^(X). 

□ 



7.5 Solvability by radicals 

The question of solvability by radicals is the one of solving polynomial equations 
under the restriction that we are only allowed to perform addition, subtraction, 
multiplication, division, and taking nth roots. 

For example, we know (Fontana-Tartaglia. 1535) that for a cubic equation 

X^+pX = q, 

the solution is given by 




By the 16th century all polynomial equations of degree smaller or equal to 4 were 
solved. The natural question was then: what happens with quintic equations? 
Euler failed to give an answer, Lagrange (1770) proved that it depends on finding 
functions of the roots which are unchanged by certain permutations of the roots, 
and that this approach works up to degree 4 and fails for 5. Abel showed (1824) 
that quintics arc insolvable by radicals. The next question thus became: decide 
whether or not a given equation can be solved by radicals. Liouvillc (1843) 
found the answer in Galois's papers. 

The answer is to be found by connecting the problem with field theory as 
follows. We first need to define the notion of a radical extension. Informally, a 
radical extension is obtained by adjoining a sequence of nth roots. For example, 
to get a radical extension of Q containing 

we must adjoin 



This can be stated formally: 
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Definition 7.6. An extension E/F is radical if E = F{ai, . . . where for 
all i = 1, . . . , n, there exists an integer n(i) such that 

a"^'^ ei^(ai,...,«,_i), i>2. 
The tti's are said to form a radical sequence for E/F. 
Example 7.8. The expression 

is contained in Q(q;, /3, 7, (5, e), where 

Definition 7.7. Let / be a polynomial over a field F of characteristic zero (this 
is a simplifying assumption). We say that / is solvable (soluble) by radicals if 
there exists a field E containing a splitting field for / such that E/F is a radical 
extension. 

We want to connect radical extensions and solvable groups. Here is the main 
theorem: 

Theorem 7.11. If F is a field of characteristic zero, and F C E Q M where 
M/F is a radical extension, then the Galois group of E / F is a solvable group. 

Thus a solvable (by radicals) polynomial has a solvable Galois group (of a 
splitting field over the base field). 

Recall that a group G is solvable if G has a normal series 

{1} = Gr < Gr-l < . . . < Go = G 

with Gi/Gi+i abelian. The proof takes some fair amount of work, though the 
idea is simple. A radical extension is a series of extensions by nth roots. Such 
extensions have abelian Galois groups (to be proven though...), so the Galois 
group of a radical extension is made up by fitting together a sequence of abelian 
groups (unfortunately, the proof is not that simple...) 

We can restate the above result in terms of polynomials. 

Theorem 7.12. Let f be a polynomial over a field E of characteristic zero. If 
f is solvable by radicals then its Galois group (that is the Galois group of its 
splitting field) over E is a solvable group. 

To find a polynomial which is not solvable by radicals, it suffices to find one 
whose Galois group is not solvable. 

Lemma 7.13. Let p be a prime, f an irreducible polynomial of degree p over 
Q. Suppose that f has precisely two non-real zeros in C. Then the Galois group 
of f over Q is the symmetric group Sp . 
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Theorem 7.14. The polynomial — 6X + 3 over Q is not solvable by radicals. 

The proof consists of showing that the polynomial is irreducible over Q, by 
Eisenstein's criterion. Then / has exactly three real zeros with multiplicity 1 
each, and the above lemma says that is Galois group is 55. To conclude, we 
need to show that the symmetric group Sn is not solvable if n > 5. 

7.6 Solvability by ruler and compasses 

The ancient Greek philosopher Plato believed that the only perfect figures were 
the straight line and the circle, and this belief had a great impact in ancient 
Greek geometry: it restricted the instruments available for performing geomet- 
rical constructions to ruler and compasses. 

Many constructions can be done just be using ruler and compasses, but three 
famous constructions could not be performed: 

• duplication of the cube: find a cube twice the volume of a given cube. 

• trisection of the angle: find an angle 1/3 the size of a given angle. 

• quadrature of the circle: find a square of area equal to those of a given 
circle. 

It is no wonder those problems remained unsolved (again, under these pla- 
tonic constraints) since we will see, using our modern tools, that none of them 
are possible. 

We start by formalizing the intuitive idea of a ruler and compass construc- 
tion. Denote by Pq the set of points in M^. 

• operation 1 (ruler): through any 2 points of Pq, draw a straight line. 

• operation 2 (compasses): draw a circle, whose center is a point of Pq and 
whose radius is equal to the distance between some pairs of points in Pq. 

Definition 7.8. The points of intersection of any two distinct lines or circles, 
drawn using operations 1 and 2 are said to be constructible from Pq if there 
exists a sequence ri, . . . , r,i of points of such that for each i ~ 1, . . . , 71 the 
point ri is constructible from the set Pq U {ri, . . . , r^-i}. Pi = Pi-i U {r,}. 

We can now bring field theory into play. With each stage, we associate the 
subfield of R generated by the coordinates of the points constructed. Denote by 
Kq the subfield of M generated by the x- and y-coordinates of the points in Pq. 
If ri has coordinates {xi,yi), then inductively we define 

= Ki^i{xi,yi) 

to get 

Kq C Ki C . . . C C R. 
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Lemma 7.15. With the above notation, Xi and yi are zeros in Ki of quadratic 
polynomials over Ki-i. 

Proof. There are 3 cases to consider: line meets line, line meets circle and circle 
meets circle. We only give the proof of line meets circle. 

Take 3 points A = {p,q), B = {r,s), C ~ (t,u) in ifi-i, then draw a line 
between A and B, and a circle of center C with radius w. The equation of the 
line AB is 

x-p ^ y-q 

r — p s — q 
while the equation of the circle is 

{x-tf + {y-uf = w\ 

Solving them yields 

{x-t)'^+(- -{;x-p)+q-'u\ 

\r~p J 

Now cc, the first coordinate of the intersection point, is a zero of a quadratic 
polynomial over Ki_i. □ 

We note that fields obtained by adjoining the zeroes of a quadratic polyno- 
mial arc extensions of degree 2. 

Theorem 7.16. If r = (x, y) is constructible from a subset Pq G M^, and if Kq 
is the subfield of M generated by the coordinates of the points of Pq, then the 
degrees [Kq (x) : Kq] and [Kq (y) : Kq] are powers of 2. 

Proof. We have that 

[K,.i(xi) : lU^i] = 1 or 2, : K,^i\ = 1 or 2. 

Using multiplication of degrees, we get 

[K,_i{xi,y{) : Ki^i] = [Ki^i{xi,yi) : Ki_i{xi)][Ki_i{xi) : Ki^i] = 1 or 2 or 4 

with Ki = Ki^i(xi,yi). Thus [Kn ■ Kq] is a power of 2 implying that [Kn '■ 
Kq{x)][Kq{x) : Kq] is a power of 2 from which we conclude that [Kq{x) : Kq] is 
a power of 2, and similarly for y. □ 

We are now ready to discuss the impossibility proofs. 

Theorem 7.17. The cube cannot be duplicated using ruler and compass con- 
structions. 

Proof. Take a cube whose side is the unit interval, that is of volume 1. We have 
Pq = {(0, 0), (1, 0)} and Kq = Q. If we could duplicate the cube, then we can 
construct a point (q;,0) such that the volume is equal to 2, that is 
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Now [Q{a) : Q] is a power of 2, but a is a zero of — 2 which is irreducible (by 
Eisenstein) over Q. This gives that 

Ma) : Q] = 3, 

a contradiction to the fact that it should be a power of 2. □ 

Theorem 7.18. The angle 7r/3 cannot be trisected using ruler and compass 
constructions. 

Proof. Constructing an angle trisecting 7r/3 is equal to constructing the point 
(a, 0) given (0,0) and (1,0) where a = cos(7r/9). Knowing a — cos(7r/9), we 
can construct 

/3 2cos(7r/9). 

Using that cos(30) Acos^{e) - 3cos(6') and cos(30) = 1/2 when 9 = tt/9, we 
have 

1 = 8cos3(6i) - 6cos(6i) =^^3-3/3-1 = 0. 
Now f{t) = t"^ — 3t — 1 is irreducible over Q (apply Eisenstein on /(t + 1)) thus 

[Q(/3) : Q] = 3 

contradicting the fact that it should be a power of 2. □ 

Theorem 7.19. The circle cannot be squared using ruler and compass con- 
structions. 

Proof. Without loss of generality, we assume that the circle is the unit circle 
centered at (0, 0). Constructing a square with area tt is equivalent to construct- 
ing a point (^/tt, 0). Since the smallest field with and 1 is Q, the field obtained 
from adjoining {y/TT,0) is Q(v^)- Thus [Q{\/tt) : Q] should be a power of 2, 
and in particular it should be algebraic, which is a contradiction (Lindeman's 
Theorem shows the transcendence of tt, 1882). □ 



The main definitions and results of this chapter are 

• (4.1). Definitions of: Galois extension, Galois group, 
fixed field. 

• (4.2). The fundamental theorem of Galois theory, 
Galois groups of intermediate fields. 

• (4.3). Characterization of finite fields, their Galois 
group, their multiplicative group. 

• (4.4). Definition of cyclotomic field, primitive root 
of unity, cyclotomic polynomial. The Galois group of 
a cyclotomic field. 



Chapter v-l 

Exercises on Galois Theory 



Exercises marked by (*) are considered difficult. 

8.1 Galois group and fixed fields 

Exercise 96. Compute tire Galois group of X'^ — 2 over Q and F3, the finite 
field with 3 elements. 

Answer. Over Q, we have 

X*-2^{X^- V2){X'^ + V2)^{X- 2^/^){X + 2^/^){X - i2^/^){X + 12^1^), 

while over F3, let be a root of the irreducible polynomial + X + 2 = 0, 
then 

= —w + 1, w'^ — —1, = 1 

and 

X^ ~2^ X^ + 1 = {X^ - w^){X^ + uP) = iX- w){X + w){X - w^)iX + w^). 

8.2 The fundamental Theorem of Galois theory 

Exercise 97. 1. Compute the splitting field K of the polynomial f{x) ~ 
a;4 _ 2 e Qix). 

2. Show that K is a, Galois extension. 

3. Compute the degree of K/Q. 

4. Compute the Q-automorphisms of K. 

5. Do you recognize Gal(A'/Q)? 
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6. What are all the subgroups of Gal{K/Q)l 

7. What are all the intermediate subfields of K/Ql 

8. Among the intermediate subfields, which arc normal? 
Answer. 

1. We have that /(a;) = {x'^ - y/2){x'^ + ^2) = {x- ^){x+ ^){x + i^){x^ 
iy^). Thus the splitting field of / is </2). 

2. It is a splitting field thus K is normal, it is separable because Q is of 
characteristic zero. 

3. The degree is 

mV2,i) : Q] = [Q{</2,i) : Q(^)][Q(^) : Q]. 

The minimum polynomial of i over Q{\/2) is x^+l, so [Q{\^, i) : <^{\/2)] = 
2. Since f{x) is irreducible over Q (by Eisenstein). it is the minimal poly- 
nomial of over Q, thus [Q(v^) : Q] 4 and finally the total degree is 
8. 

4. There are 8 of them. We have 
and 

T(^) = -^, r(^) = ^ 
and we can find the others by combining these two, namely: 



1 : 






i i-^ i 


a : 




i</2 


i i-^ i 


a': 




-</2 




a' : 






i i-^ i 


T : 








CJT : 




i H- — 








i I— > — 








2 I— > — 



5. This is the dihedral group of order 8. 

6. • order 8: G, order 1: {1}. 

• order 4: there arc 3 of them 

S = {1,C^,C^^C^^} ~ C4, T = {1, (T^, T, crV} ~ C2XC2, U = {1, cr^, (TT, cr'V} ~ C2XC2. 
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• order 2, there are 5 of them, all isomorphic to C2: 

A = {l,a'^}, B = {l,r}, C = {l,crr}, D^{1,<7^t}, E = {1,c7^t}. 

7. By Galois correspondence, we obtain the intermediate fiels as fixed fields 
of the subgroups. The subfields of degree 2 are the easiest to find: 

Q(i), qiV2}, QiiV2) 

which are fixed by resp. S, T and U. By direct computation (that is, 
apply the automorphism on an element of the larger field, and solve the 
equation that describes that this element is fixed by this automorphism), 
we find that the others are: 

fixed resp. by C, A and B. 

8. The normal subgroups of G are G, S,T,U, A, I, thus their corresponding 
fixed fields are normal extensions of Q. 

Exercise 98. Let K be the subficld of C generated over Q by i and a/2. 

1. Show that [K : Q] = 4. 

2. Give a primitive element of K and its minimal polynomial. 

3. Show that Gal(A7Q) ~ (Z/2Z)2. 

4. Give a fist of all the subfields of K. 
Answer. 

1. Since K = Q{i,V^), we can first build Q{V2)/Q which is of degree 2, 
because — 2 is irreducible, then we check that x'^ + 1 is irreducible over 
Q{\/2), so we obtain another extension of degree 2, by multiplicativity of 
the degrees, this gives an extension of degree 4. 

2. For example, Cs? the primitive 8th root of unity, is a primitive element, 
with minimal polynomial + 1. 

3. The Galois group is given by {1, cr, t, err} where 

a : i ^ —i, -^2 n- ^/2, t : i, \/2 — \/2. 

4. There is one for each subgroup of the Galois group. Since there are only 
subgroups of order 2 (but for the whole group and the trivial subgroup), 
we get 3 quadratic field extensions: 



Q(z), Q(\/2), Q(iA/2). 
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Exercise 99. 1. Show that X'^ — 3 = is irreducible over Q. 

2. Compute the sphtting field E ol X"^ — 3 = 0. 

3. Compute the Galois group of E/Q. 

4. Can you recognize this group? 

5. Choose two proper, non-trivial subgroups of the Galois group above, and 
compute their corresponding fixed subfields. 

Answer. 

1. Use Eisenstein with p ~ 3. 



2. The roots of X"^ — 3 arc j — 0,1,2,3, thus the splitting field is 



4. It is the dihedral group. 

5. Again as in previous exercise. 

Exercise 100. Consider the field extensions M = Q(V2,a/3) aud E = M{a) 



1. Show that M is a Galois extension of Q with Galois group C2 x C2. 

2. Denote by a and r the generators of the two cyclic groups of (1), so that 
the Galois group of M is written (r) x (<t). 

• Compute cr(Q;^)/a^ and deduce that a ^ M. What is the degree of 
E over Q? 

• Extend a to an automorphism of E and show that this automorphism 
has order 4. 

• Similarly extend r to an automorphism of E and compute its order. 
What is the Galois group of E over Q? 



1. A//Q is clearly Galois because it is separable (Q is of characteristic 0) and 
normal. 

2. • Wehavecr(a2)/Q;2 = {^2-1^ thus a{a'^) = (a(\/2-l)2. If a were in 

M, then a{a) = ±a(v^-l) and a'^{a) = 0(^/2 - 1)(-\/2- 1) = -a, 
a contradiction (cr^(a) ~ a). 

• We have cr^(a) = —a thus a^{a) = a, a^\M ~ 1 and cr^ 7^ 1. 



3. As in previous exercise, with ^/3 instead of a/2. 




Answer. 
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• We compute that T{a) = ^ J^ ct, we extend r and its order is 4. We 
then compute that 

ar{a) ^ ^-^(^/2- l)a, r(a(a) ^ {V2 - l)^^a 

so (T and r anticommutc, and they are both of order 3, so the Galois 
group is the quaternion group. 

Exercise 101. Let L/K be a Galois extension of degree 8. We further assume 
that there exists a subextension Af / K of degree 4 which is not a Galois extension. 

• Show that the Galois group G of L/K cannot be abelian. 

• Determine the Galois group G of L/K. 
Answer. 

• A subextension M/K of degree 4 which is not Galois, means that there 
is a subgroup of order 2 which is not normal in G. Thus G cannot be 
abelian, since all subgroups of an abelian group are all normal. 

• The only groups of order 8 which are not abelian are and Qg. All the 
subgroups of Qs are normal, thus it must be D4. 

Exercise 102. Assume that the polynomial X"^ + aX^ + b E Q[X] is irreducible. 
Prove that its Galois group is: 

1. the Klein group if Vb e Q. 

2. the cyclic group of order 4 if \/ a? — AbVb G Q. 
Answer. 

1. Set Y = X^, then 

Y^ + aY + b={Y-yi){Y-y2) 

with 

-a + \Jo?- - 4b -a - \Ja?- ~ 4b 

2/1 = 2 = 2 

and X — ±-\/y so that the four roots are ±y^, ±y^. Now 2/12/2 — b 
thus 

and if cr(^2/i) ~ \/y2, then we have that 

c^iVlh) = ^/^{\/yi) = ^/ \/V2 = \f\r\ 

and all the elements of the Galois group have order 2, so that it must be 
the Klein group. 
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2. We have 
thus 

VyiVy2iyi - 2/2) = v/6 V - 46 e q. 

Now take cr{,yyi) = ^Jy2 and if it were of order 2, then cr(y^) = and 

which contradicts that ^Jyx^fyiijjx — y2) & Q thus cr is of order 4 and the 
Galois group must be the cychc group of order 4. 

8.3 Finite fields 

Exercise 103. Identify the finite fields Z\i]/{2 + i) and Z\i]/{7). 
Answer. F5 and F49 

Exercise 104. Consider the following two polynomials p(.t) = — 1 G V^lx] 

and q{x) = + 1 e ¥^[x]. Consider the fields ¥3[x]/{p{x)) ~ F3(a) where 
p{a) = and ¥3[x]/{q{x)) ~ F3(^) where q{f3) = 0. 

1. Compute (a+1)^. 

2. Deduce that the two fields F3(q;) and ^^{P) are isomorphic. 
Answer. 

1. We have (a + 1)^ = - a + 1 = (a + 1) - a + 1 = 2 = -1. 

2. We have that = — 1 by definition of /3 and we have just shown above 
that (of + l)^ = —1, thus it is natural to map /3 to a + 1, that is / : F3(/3) — >■ 
F3(a), a + b/3 a + b{a + 1). Check that / is a ring homomorphism. 
Then argue that a field homomorphism is always injective, and that both 
fields have same number of elements. 

Exercise 105. Let F2 be the finite field with two elements. 

1. Show that F2(/3) = F2[X]/(g(X)) is a finite field, where ^(X) =X^+X + 1 
and g(/3) = 0. 

2. Consider the polynomial r{Y) ^ V"^ + Y + /3 e F2(/3)[y], and set L = 
F2(/3)[y]/(r(y)). 

• Is L a field? Justify your answer. 

• What is the cardinality of L7 What is its characteristic? Justify your 
answers. 

Answer. 
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1. It is enough to show that q{X) is irreducible over F2, this generates the 
finite field F4 ~ F2(/3). 

2. • We have to see if r{Y) is irreducible over F4. It is enough to evaluate 

it in /? and /3 + 1 to see that it is not zero. 

• This creates an extension of degree 2 of F4. that is 16 elements. It 
has characteristic 2. 

Exercise 106. • Let Fp be a finite field, p > 3 a prime number. Show that 
the sum of all the elements of Fp is 0. 

• Let q = p^, p a. prime. Show that if g 7^ 2, then the sum of all elements of 
F, is 0. 

• Let q = p"' , p a, prime. Show that the product of all the non-zero elements 
of a finite field ¥q is -1. 

Answer. 

• There are many ways of doing that. Modulo p, one could simply notice 
that l + 2 + ...+p— 1 is p{p— l)/2, if p > 3, p is an odd prime, thus p— 1 
is even, {p — l)/2 is an integer and thus mod p we do get 0. 

• An element a in ¥q satisfies that a^" = a, that is, it is a root of A^" — X. 
Now all the roots of this polynomial exactly coincide with the elements of 
Fg, that is, we can write 

AP" -A = W (A -a). 

If we develop the product, we get that the term in A^""^ has as coefficients 
exactly the sum of the elements of F^, which is thus 0. 

• This follows from above. Now we just factor A from the polynomial 

AP" - A to get 

AP"-i - 1 = ]J (A-a). 

Now —1 corresponds to the constant term of the product, which is exactly 
the product over all non-zero elements of the finite field. 

Exercise 107. Consider the finite fields F2,F3 and F4, and the polynomial 
P{Y) — + Y + 1. Over which of these finite fields is P{Y) irreducible? If 
possible, construct the corresponding field extension. 

Answer. Since this polynomial is of degree 3, if it is reducible, that means 
there is at least one linear term, that is one root in the base field. It is thus 
irreducible over F2, however over F3, we have that P(l) = 0, and over F4, we 
have no root. Over F2, we get an extension of degree 3, that is Fg, over F4, we 
get an extension of degree 3, that is F43. 
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4.4 Cyclotomic fields 

Exercise 108. Let ^ be a primitive 20th root of unity in C, and let E = Q(C)- 

• Compute the Galois group Gal{E/Q). 

• How many subficlds of E are there which are quadratic extensions of Q? 

• Determine the irreducible polynomial of C over Q. 
Answer. 

• We know that Gal(£;/Q) ~ (Z/20Z)*. 

• There arc 3 of them: Q{iV5), Q(\/5) and Q{i). 

• It is - + - + 1. 

4.5 Solvability by radicals 

4.6 Solvability by ruler and compasses 

Exercise 109. True/False. 

Ql. An extension having Galois group of order 1 is normal. 
Answer. 

Ql. It's false! If there is only one element, then it's the identity. Again Q{a) 
with = 2 has a Galois group with only the identity, and it is not normal! 
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